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Abstract. Pseudomonas aureofaciens grown on octanoate or gluconate synthesized medium-chain- 
length polyhydroxyalkanoates (mcl-PHAs). To clone the PHA synthase gene(s) (phaC) y the genomic 
library of P. aureofaciens was constructed using a cosmid vector. The recombinant cosmids that clone 
phaC were detected by the complementation with a PHA-negative mutant, P, putida GPpl04. The 
resulting recombinant cosmid, named pVK6, contained a 13-kbp DNA insert. Genetic analysis of the pha 
locus in pVK6 revealed the presence of six ORFs, genes encoding two PHA synthases, 1 and 2 {phaCl 
and phaC2), PHA depolymerase (phaZ), two PHA granule-associated proteins [phaF and phal), and an 
unknown protein (phaD). The heterologous expression of pha genes from P. aureofaciens was confirmed. 
P. putida GPpl04 regained the ability to accumulate PHA on introduction of pVK6. Wild-type strains P. 
oleovorans and P. fluorescens, which were unable to accumulate PHA when grown on gluconate, 
acquired the ability to accumulate PHA from gluconate when they possessed pVK6. 



PHAs are storage polymers accumulated by bacteria 
during nutrient-limited conditions [2], and are of indus- 
trial interest as a material for biodegradable thermoplas- 
tic [8]. PHAs are divided into two groups based on their 
monomer chain length; that is, short-chain-length PHAs 
(scl-PHAs, C3 to C5) and medium-chain-length PHAs 
(mcl-PHAs, C6 to CI 4). scl-PHA production by Ralsto- 
nia eutropha (formerly Alcaligenes eutrophus) has been 
extensively studied [11]. Polyhydroxybutyrate (PHB), a 
homopolymer of (R)-3-hydroxybutyrate, is the most in- 
vestigated scl-PHA. PHB production has already been 
developed using recombinant Escherichia coli carrying 
pha genes from R. eutropha, in which high amounts of 
PHB (more than 90% of the cell dry weight [CDW]) are 
synthesized and the range of carbon sources used as 
substrates is being extended [1, 7]. 

Pseudomonas strains synthesize mcl-PHA [15]. For 
example, P. putida [3, 4] and P. aeruginosa [16] can 
accumulate mcl-PHA not only when grown on alkanes or 
alkanoic acids but also when grown on unrelated sub- 
strates such as gluconate, fructose, and glucose. Com- 
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paring PHB with mcl-PHA, mcl-PHA is more useful 
because PHB is highly crystalline and brittle and the 
technical applications are limited. Moreover, mcl-PHA 
contains a variety of monomers and has versatile char- 
acteristics. Despite these factors, studies using recombi- 
nant E. coli as performed for PHB production have not 
yet been carried out for mcl-PHA production. More 
research is needed for development of mcl-PHA produc- 
tion. In this study, the results with regards to the com- 
position of PHA, genetic analysis and heterologous ex- 
pression of the pha genes in P. aureofaciens are 
described. 

Materials and Methods 

Bacterial strains and cultivation. Bacterial strains used in this study 
are P. aureofaciens IF03521, P. putida GPpl04 [5], P. oleovorans 
ATCC29347, and P. jiuorescens DSM50090. For PHA accumulation, 
bacterial strains were cultivated in a mineral salt medium containing 
0.05% (wt/vol) NH 4 CI as a nitrogen source [10] at 30°C for 72 h. 
Substrates were added at the final concentrations of 0.5% (wt/vol) for 
octanoate (0.1% [wt/vol], five times) and 1.5% (wt/vol) for gluconate, 
glucose, and fructose. 

Cloning of PHA synthase gene {phaQ. The genomic library of P. 
aureofaciens was constructed using the broad-host-range cosmid vec- 
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Table 1. PHA accumulation in P. aureofaciens 



Substrate 


PHA content 
(% [wt/wt] of 
CDW*) 






Composition of PHA (mol%)" 






3HB 


3HX 


3HO 


3HD 


3HDD 


3HDD:1 


Octanoate 


45.0 


0 


12 


82 


6 


0 


0 


Gluconate 


31.7 


0 


2 


12 


55 


17 


14 


Glucose 


32.2 


0 


4 


15 


47 


17 


17 


Fructose 


29.1 


0 


4 


18 


48 


16 


13 



a 3HB, 3-hydroxybutyrate; 3HX, 3-hydroxyhexanoate; 3HO, 3-hydroxyoctanoate; 3HD, 3-hydroxydecanoate; 3HDD, 3-hydroxydodecanoate; 
3HDD:1, 3-hydroxydodecenoate. 
"CDW, cell dry weight. 




Fig. I . Detection of C-encoded fragment. (A) aga- 
rose gel electrophoresis. Cosmid DNAs digested with 
HindlU were charged. (B) Southern hybridization. 
phaCl from P. oleovorans was used as a probe. Lane 
V: cosmid vector pVK102; lanes 1, 2, and 3: recom- 
binant cosmids pVK2, pVK4, and pVK6, respec- 
tively. 




tor, pVK102 [6]. The DNA fragment partially-digested with HM\\\ 
was inserted to the cloning site. The recombinant cosmids that clone 
phaCv/QTC detected by the complementation with P. putida GPpl04, a 
PHA-negative mutant of P. putida KT2442. 

DNA sequencing was carried out by the dideoxychain termination 
method as described by Sanger et al. [13] using the Automatic Se- 
quencer DSQ-2000 L (Shimadzu). The sequencing reaction was per- 
formed in accordance with the manual supplied with DNA sequencing 
kit, the Big Dye Terminator Cycle Sequencing Ready Reaction (Perkin 
Elmer Biosystems). 

Southern hybridization was conducted using phaCl from P. oleo- 
vorans as a probe. The labeled probe was prepared using the ECF 
Random Prime Labeling Kit (Amersham Pharmacia Biotech) and de- 
tection of hybridization signals on membrane was carried out using 
Fluorolmager 595 (Molecular Dynamics). 

Analysis of PHA. PHAs were extracted from lyophilized cells with 
chloroform and then methanolyzed for analysis by gas chromatography 



[15]. The methyl esters of monomers were assayed using the Gas 
Chromatograph GC-12 (Shimadzu) and a column packed with Ther- 
mon-3000 supported on 60/80 Shincarbon (Shinwakakou) as the sta- 
tionary phase. 

Nucleotide sequence accession number. The nucleotide sequence 
data are registered in the DDBJ nucleotide sequence database under the 
accession number AB049413. 



Results and Discussion 

PHA accumulation in P. aureofaciens. P. aureofaciens 
was cultivated on octanoate, gluconate, fructose, or glu- 
cose as the sole carbon source. The content and compo- 
sition of accumulated PHA using these substrates in P. 
aureofaciens are summarized in Table 1. mcl-PHA ac- 
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Fig. 2. pha locus in P. aureofaciens. Proteins 
(genes): unknown 1 (ORFl), PHA synthase I 
{phaCl\ PHA depolymerase {phaZ), PHA synthase 
2 {phaC2\ unknown 2 {phaD), and PHA granule- 
associated proteins (phaF and phaf). Arrows show 
the direction of transcription. H, HindlU. 



Table 2. Heterologous expression of pha genes from P. aureofaciens 



Strain (cosmid carried) 


Substrate 


PHA content 
(% [wt/wt] of CDW") 


3HB 


3HX 


Composition of PHA (mol%f 
3HO 3HD 3HDD 


3HDD:1 


P. putida GPp\04 (pVK102) 


Gluconate 


0 


0 


0 


0 


0 


0 


0 


P. putida GPpl04 (pVK6) 


Gluconate 


12.1 


0 


0 


8 


54 


20 


17 


P. putida GPpl04 (pVK.102) 


Octanoate 


0 


0 


0 


0 


0 


0 


0 


P. putida 104 (pVK6) 


Octanoate 


22.4 


0 


6 


86 


5 


2 


0 


P. oleovorans (pVK102) 


Gluconate 


0 


0 


0 


0 


0 


0 


0 


P. oleovorans (pVK6) 


Gluconate 


7.5 


0 


0 


24 


54 


15 


7 


P. oleovorans (pVK 1 02) 


Octanoate 


40.3 


0 


8 


87 


5 


0 


0 


P. oleovorans (pVK6) 


Octanoate 


48.6 


0 


7 


89 


2 


0 


0 



Both strains of P. putida GPpl04 and P. oleovorans each carrying a cosmid were used. 

pVKI02, broad-host-range cosmid vector; pRlC6, pVK102 cloning pha genes from P. aureofaciens, 

a 3HB, 3-hydroxybutyrate; 3HX ; 3-hydroxyhexanoate; 3HO, 3-hydroxy octanoate; 3HD. 3-hydroxydecanoate; 3HDD, 3-hydroxydodecanoate; 
3HDD:1, 3-hydroxydodecenoate. 
b CDW, cell dry weight 



cumulated approximately 45% of CDW in the presence 
of a related carbon source (octanoate), and approxi- 
mately 30% of the CDW in the presence of unrelated 
carbon source (gluconate, glucose, or fructose). Accu- 
mulated PHA with gluconate, glucose, or fructose as the 
sole carbon source mainly consisted of 3-hydroxyde- 
canoate (3HD), 3-hydroxydodecanoate (3 HDD), and 
3-hydroxydodecenoate (3HDD:1). Accumulated PHA in 
P. aureofaciens did not include any 3 -hydroxy buty rate 
(3HB) as a monomer. 

Genetic analysis of pha genes. Seven recombinant cos- 
mids, which conferred the PHA synthesizing ability to 
the PHA-negative mutant, P. putida GPpl04, were iso- 
lated. Each recombinant cosmid containing a partially 
digested HindlU fragment in the cloning site was com- 
pletely redigested with HindlU. The digestion pattern 
was classified into three types that correspond to recom- 
binant cosmids pVK2, pVIC4, and pVK6 (Fig. 1). The 
recombinant cosmids had 6.5-kbp, 4.3-kbp, and 2.3-kbp 
HindlU fragments in common as shown in Fig. 1A. 
Southern hybridization was carried out using phaCl 
from P. oleovorans as a probe. Results showed that phaC 
from P. aureofaciens was encoded in the 6.5-kbp DNA 
fragment (Fig. IB). 



About an 8-kbp area of the 6.5-kbp and flanking 
4.3-kbp fragments was sequenced. Genetic analysis re- 
vealed the presence of seven ORFs (Fig. 2). The pha 
locus contained genes encoding two PHA synthases, 1 
and 2 (phaCl and phaC2), PHA depolymerase (phaZ), 
an unknown protein (phaD), and two PHA granule- 
associated proteins (phaF and phaf). A homologous pha 
cluster showing a similar gene organization has been 
found in mcl-PHA-synthesizing bacteria P. aeruginosa 
[16] and P. oleovorans [5, 12]. 

Heterologous expression of pha genes from P. aureo- 
faciens. As described in the Materials and Methods 
section, the genomic library was introduced into P. 
putida GPpl04 to detect recombinant cosmids that clone 
phaC. Recombinant, P. putida GPpl04 cells carrying 
phaC, were selected by their white colony formation due 
to PHA accumulation on an octanoate plate. Here, PHA 
accumulation was tested using recombinant P. putida 
GPpl04 carrying pVK6, which contained the smallest 
DNA insert of 13 kbp. The recombinant accumulated 
PHA when grown on octanoate or gluconate as shown in 
Table 2. These results show that phaCl and/or phaC2 
were/ was expressed in P. putida GPpl04. 

P. oleovorans accumulates PHA when grown on 
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octanoate but not when grown on gluconate [5]. Glu- 
conate-grown cells of P. oleovorans carrying pVK.6 ac- 
cumulated PHA and octanoate-grown cells had an in- 
creased PHA content as shown in Table 2. These results 
show that the pha genes from P. aureofaciens were 
expressed in P. oleovorans. Similar results as obtained in 
P. oleovorans were shown in P. fluorescens. P. fluores- 
cens do not accumulate any detectable PHA when grown 
on gluconate [15]. P. fluorescens also accumulated PHA 
at approximately 5% of CDW using gluconate on intro- 
duction of pVK6 (data not shown). 

Much attention is given to PHA production using 
renewable sources such as carbon dioxide and industrial 
and agricultural wastes [1, 7, 9]. Cyanobacteria are 
promising hosts for PHA production from carbon diox- 
ide because they can grow using carbon dioxide as the 
sole carbon source. PHA production by cyanobacteria 
has been studied using both isolates from natural habitat 
and recombinant cyanobacteria carrying pha genes intro- 
duced from R. eutropha [9, 14]. E. coli is another prom- 
ising host. E. coli normally does not accumulate PHA. 
Furthermore, PHA produced by recombinants with in- 
troduced pha genes does not decompose since E. coli 
does not carry PHA depolymerase. PHB production has 
already been successfully carried out using cyanobacte- 
ria [9, 14] and E. coli [1, 7] as described in the introduc- 
tion. For practical applications, flexible mcl-PHAs are 
generally considered to be more useful than brittle PHB. 
However, studies of mcl-PHA production by cyanobac- 
teria and E. coli have hardly progressed, pha genes from 
P. aureofaciens will be useful for the mcl-PHA produc- 
tion using renewable sources as substrates. 
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In the enteric bacterium, Escherichia coli, acyl coen- 
zyme A synthetase (fatty acid:CoA ligase (AMP-form- 
ing) EC 6.2.1.3) activates exogenous long-chain fatty 
acids concomitant with their transport across the inner 
membrane into metabolically active CoA thioesters. 
These compounds serve as substrates for acyl-CoA de- 
hydrogenase in the first step in the process of 0-oxi- 
dation. The acyl-CoA synthetase structural gene, fadD, 
has been identified on clone 6D1 of the Kohara E. coli 
gene library and by a process of subcloning and com- 
plementation analyses shown to be contained on a 2.2- 
kilobase Ncol-Clal fragment of genomic DNA. The 
polypeptide encoded within this DNA fragment was 
identified following T7 UNA polymerase-dependent 
induction and estimated to be Af r = 62,000 using SDS- 
polyacrylamide gel electrophoresis. The N-terminal 
amino acid sequence of acyl-Co A synthetase was deter- 
mined by automated sequencing to be Met-Lys-Lys- 
Val-Trp-Leu-Asn-Arg-Tyr-Pro. Sequence analysis of 
the 2.2-kilobase Ncol-Clal fragment revealed a single 
open reading frame encoding these amino acids as the 
first 10 residues of a protein with a molecular weight 
of 62,028. The initiation codon for methionine was 
TTG. Primer extension of total in vivo mRNA from 
two fadD-specific oligonucleotides defined the tran- 
scriptional start at an adenine residue 60 base pairs 
upstream from the predicted translations! start site. 
Two FadR operator sites of the fadD gene were iden- 
tified at positions -13 to -29 (0 D i) and positions -99 
to -115 (Ods) by DNase I footprinting. Comparisons of 
the predicted amino acid sequence of the E. coli acyl- 
CoA synthetase to the deduced amino acid sequences of 
the rat and yeast acyl-CoA synthetases and the firefly 
luciferase demonstrated that these enzymes shared a 
significant degree of similarity. Based on the similar 
reaction mechanisms of these four enzymes, this simi- 
larity may define a region required for the same func- 
tion. 



Exogenous long-chain fatty acids (Ci 2 -Ci 8 ) represent an 
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important class of hydrophobic compounds that can serve as 
a sole carbon and energy source to support the growth of the 
enteric bacterium Escherichia colu The acquisition of these 
nutrients from the environment prior to metabolic utilization 
by cyclic 0-oxidation in E. coli occurs by an energy-dependent, 
protein -mediated process. For long-chain fatty acids destined 
for /S-oxidation, this process minimally requires the products 
of the fadL and fadD genes. The fadL gene encodes an outer 
membrane-bound protein (FadL) that binds exogenous long- 
chain fatty acids with a relatively high affinity and by some 
unknown mechanism transfers these compounds across the 
outer membrane (1-4). The fadD gene encodes the inner 
membrane-associated acyl coenzyme A synthetase (acyl -CoA 
synthetase (fatty acidxoenzyme A ligase (AMP-forming), EC 
6.2.1.3)) (5). This enzyme catalyzes the esterification of fatty 
acids into metabolically active CoA thioesters concomitant 
with transport. The mechanisms that govern the transfer of 
long-chain fatty acids from FadL across the periplasmic space 
and the inner membrane to the acyl-CoA synthetase remain 
largely undefined. There is some evidence for an oleic acid 
binding protein in the inner membrane that has been postu- 
lated to be a H + /k)ng-chain fatty acid co-transporter (6, 7). 

Acyl-CoA synthetases catalyze the formation of fatty acyl- 
CoA by a two-step mechanism that proceeds through the 
pyrophosphorolysis of ATP (8). 

Mg a \ 

fatty acid + ATP < * (fatty acid - AMPJ + PP, 

Structure I 

[fatty acid - AMP] + CoASH -> fatty acid - SCoA + AMP 

Structure II 

E, coli contains a single acyl-CoA synthetase which has been 
purified to homogeneity (9, 10). This enzyme has broad chain- 
length specificity giving V mM values ranging from 2632 nmoi/ 
min/mg of protein for lauric acid (Ci 2 ) to 135 nmol/min/mg 
of protein for hexanoate (C 6 ) (9). Maximal activities associ- 
ated with this enzyme are found with fatty acids ranging in 
length between C« and C^i (9). Overath and colleagues (11) 
proposed that acyl-CoA synthetase of E. coli was required for 
long-chain fatty acid transport and coined the term vectorial 
acylation to describe the role of this enzyme. Although the 
precise role of this enzyme in fatty acid transport is not well 
defined, it is clear that it plays a pivotal role in this process 
by catalyzing the formation of metabolically active CoA 
thioesters for subsequent degradation or incorporation into 
phospholipids (5, 9, 11). 

The structural gene for acyl-CoA synthetase (fadD) was 
identified by Overath et ol (5) who mapped this locus to the 
40 -min region of the E. coli chromosome. In this pioneering 
work, this enzyme was proposed to be partially membrane- 
associated and was shown to activate both mono- and poly- 
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unsaturated fatty acids. Acyl-CoA synthetase from E. coli has 
been estimated to have a native molecular weight of 120,000 
based on elution profiles of the purified enzyme on a G-200 
column (5). Kameda and Nunn (9) estimated a monomeric 
molecular weight of 47,000 and proposed that the enzyme is 
a dimer. Acyl-CoA synthetase activity is induced by oleate, 
but to a lower relative level, when compared to the levels of 
induction documented for three other enzymes required for 
long-chain fatty acid degradation (/?-hydroxyacyl-CoA dehy- 
drogenase, enoyl-CoA hydratase, and 0-ketothiolase) (5). The 
fadD gene, like the fadBA, fadE, and fadL genes, is part of the 
fatty acid degradative regulon under the control of the tran- 
scriptional regulator FadR (11-13). 

The present work describes the cloning, sequencing, and 
expression of the acyl-CoA synthetase structural gene (fadD) 
of E. coli. This work stems from our goal to define the 
underlying biochemical mechanisms that govern long-chain 
fatty acid transport in enteric bacteria prior to metabolic 
utilization. 

EXPERIMENTAL PROCEDURES 

Bacterial Strains—The E. coli strain JM103 [Mlac pro) thi strA 
endA sbcB hsdRW traD36 proAB loci* ZAM15)] was used for the 
propagation of M13 derivatives. For routine plasmid propagation and 
the generation of the fadD86 strain PN235, C600 (F-thi-1 leuB6 lac Yl 
tonA21 supE44) was used. Strains RS3010 (fadR) and LS6928 
(fadD88 zea::Tnl0 fadR) have been described elsewhere (14, 15). The 
fadD68 strain PN235 was generated by Pi transduction of strain 
C600 using a phage stock grown on strain LS6928. Bacterial cultures 
were grown at 37 "C in a Lab Line gyratory shaker in 2YT (16), Luria 
broth (LB; 16), or Tryptone broth (TB; 17). When minimal medium 
was required, medium E supplemented with vitamin Bi (17) was used. 
Carbon sources, sterilized separately, were added to final concentra- 
tions of 25 mM glucose, 25 dim potassium acetate, 5 mM decanoate, 
or 5 mM oleate. As required, amino acids were added to a final 
concentration of 0.01%. When required to maintain plasmids, anti- 
biotics were added to 100 pg/ml ampicillin, 40 fig/m\ kanamycin, 10 
/ig/m) tetracycline, and 40 $ig/m\ chloramphenicol. Growth of bacte- 
rial cultures was routinely monitored using a Klett-Summerson col- 
orimeter equipped with a blue filter. 

Identification of the fadD Gene in the Kohara Gene Library — The 
complete miniset of X clones of the Kohara library (18) were gra- 
ciously provided by Dr. Y. Kohara (DNA Research Center, National 
Institute of Genetics, Mishiraa 411, Japan). Eleven clones represent- 
ing the 40-min region of the E. coli chromosome (5E12, 4B8, 12H7, 
3E12, 9F2, 7F2, 6D1, 12B3, 15D5, 19H3, and 12C7) were propagated 
on the bacterial strain NM621 as previously described (19). Lysates 
were used to infect the XCI857 lysogen derived from strain PN235 at 
a multiplicity of infection of 1 as described by Miller (17). Following 
absorption, 1 ml of LB was added and the cells were allowed to 
recover for 30 min at 30 "C. Following recovery, the cells were pelleted 
by centrifugation, resuspended in the original volume of Medium E, 
plated on oleate minimal agar plates, and incubated at 30 *C for 72 
h. At 72 h, colonies that were able to grow on oleate as a sole carbon 
and energy source were identified in cells infected with phage DNA 
from clones 7F2 and 6D1. These two clones were confirmed to restore 
the ability of the fadD strain PN235 to grow on oleate (Ole*) following 
a second round of lysogenic complementation. Clones 7F2 and 6Dl 
from the Kohara library were propagated in strain NM621 for DNA 
isolation on TB agarose plates (24). Plaques giving nearly confluent 
lysis were visible 12-14 h later at which time the plates were flooded 
with 3 ml of 50 mM Tris-HCl, pH 7.5, 100 mM NaCl, 10 mM MgSO«, 
0.01% gelatin (SM). X-DNA was isolated using the hexadecyltrime- 
thylammonium bromide (CTAB) method (20). Briefly, 20 ml of each 
phage stock (4 x 10 10 plaque-forming units/ml) was incubated with 
DNase I at 20 pg/ml for 5 min and then clarified by centrifugation 
(10,000 X g for 20 min). The supernatants were transferred to a new 
tube, and 10 ml of DEAE-cellulose slurry (80% in SM) was added 
and incubated at room temperature for 30 min in an angled rotator. 
The DEAE-cellulose was pelleted by centrifugation, the supernatants 
were transferred to a new tube, and EDTA was added to 20 mM and 
Tris-HCl, pH 8.0, was added to 100 mM. Proteinase K was added to 
the mixture to a final concentration of 50 Mg/ml which was then 
heated at 45 'C for 15 min. Hexadecyltrimethylammonium bromide 



was added to a final concentration of 0.5% and incubated at 68 *C 
for 4 min. The samples were cooled on ice and centrifuged 10,000 x 
g tor 30 min. The DNA pellets were dissolved in 6 ml of 1.2 M NaCl 
and DNA precipitated by the addition of 2 volumes of absolute 
ethanol. The final DNA samples were resuspended in 200 jil of 10 
mM Tris-HCl, pH 7.6, 5 mM EDTA (TE), analyzed by agarose gel 
electrophoresis, and used in subcloning experiments as described 
below. 

Cloning and Sequencing — DNA from clone 6D1 was restricted with 
Clal or Hindlll, ligated into the plasmid vector pACYCl77 (21), and 
transformed into the fadD8S strain PN235. Restriction, ligation, 
plasmid isolation, and transformation procedures have been described 
previously (22, 23). Ole* transformanta were identified in both sets 
of ligation mixtures. Analysis of restriction patterns generated re- 
vealed that restriction fragments from the Ole* transformants from 
the Clal ligation mixture and the Hindlll ligation mixture overlapped. 
As the insert from the Clal digest was shown to be smaller (3.4 kb), 1 
this plasmid, designated pN300, was used for all further study. Re- 
striction fragments from pN300 were subcloned into either 
PACYC177 or pACYCl84 (21) as described under "Results," yielding 
the plasmids and complementation patterns illustrated in Fig. IB. 
The sequencing strategy of the Clal insert from pN300 is illustrated 
in Fig. 1C. The series of M13 clones were sequenced using either the 
fccZ-specific upstream primer (5'-GTTTTCCCAGTCACGAC-3'), 
the M13 universal primer (5'-GTAAAACGACGGCCAGT-3'), or 
with fadD'specific oligonucleotides by the dideoxy chain-terminating 
method of Sanger et ai (24) using Sequenase (v 2.0; U. S. Biochemi- 
cals). As shown in Fig. 1C, pN300 was sequenced across the Sail and 
Hindlll restriction sites using /a^D-specific oligonucleotides to ensure 
proper alignment between these three fragments of DNA. Sequencing 
reactions were resolved on a standard 8% polyacrytamide gel (3). All 
oligonucleotides used in this study were synthesized on a Pharmacia 
LKB Biotechnology Inc. Gene Assembler Plus. 

Analysis of Acyl-CoA Synthetase Activity— Bacteria (wild-type and 
fadD strains containing the collection of fadD* and fadD clones) were 
grown to midlog phase (6 x 10 s cells/ml) in TB or TB supplemented 
with 5 mM oleate and 0.5% Brij 58 (TBO) and with antibiotics as 
required. Cells were harvested by centrifugation, washed twice with 
Medium E, resuspended to a density of 1.2 X 10* cells/ml in 10 mM 
Tris-HCl, pH 7.5, and lysed by three cycles of sonication at 0 *C. 
Acyl-CoA synthetase activities were determined in sonicated cell 
extracts as described by Kameda and Nunn (9). The reaction mixtures 
contained 200 mM Tris-HCl, pH 7.5, 2.5 mM ATP, 8 mM MgCl 2 , 2 
mM EDTA, 20 mM NaF, 0.1% Triton X-100, 10 /*m ['HJoleate, 0.5 
mM coenzyme A, and ceil extract in a total volume of 0.5 ml. The 
reactions were initiated with the addition of coenzyme A, incubated 
at 35 "C for 10 min, and terminated by the addition of 2.5 ml of 
isopropyl alcohol:n-heptane:l M H2SO4 (40:10:1). The radioactive 
oleic acid was removed by organic extraction using a- heptane (9). 
Oleoyl-CoA formed during the reaction remained in the aqueous 
fraction and was quantified by scintillation counting. Protein concen- 
trations in the enzyme extracts were determined using the Bradford 
assay and bovine serum albumin as a standard (25). The values 
presented represent the average from at least three independent 
experiments. 

Overexpression of Acyl-CoA Synthetase— The 3.4-kb Cial fragment 
(fadD*) from pN300 was gel-purified and ligated with a Clal to 
BamHl linker. Linkers were purified and phosphorylated using bac- 
terial alkaline phosphatase prior to restriction (22). Following ligation 
of the linkers, the fragment was restricted with BamHl, repurified, 
and ligated into the BamHl site of the expression plasmid pCD130. 
pCDl30 is derived from pT7-5 and contains the fadR gene in the 
orientation opposite to the T7 promoter to maintain stability of fadD* 
(26). Both orientations of the fadD gene were obtained yielding 
plasmid pN321 and pN324. Plasmid pN324 contained fadD* under 
the T7 promoter while pN321 had fadD* in the orientation opposite 
to the T7 promoter. The plasmids pN321 and pN324 were trans- 
formed into strain BL21 (DE3)(plysS) and expressed following in- 
duction with isopropyl-l-thio-/?-D-galactopyranoside (27). Following 
induction and labeling with ["S] methionine, cells were harvested, 
resuspended in SDS sample buffer, boiled, and resolved on a 12% 
SDS-polyacrylamide gel using the Laemmli buffer system (28). Fol- 
lowing electrophoresis, the gels were dried and subjected to autora- 
diography for 2-24 h. 

Partial Purification of Acyl-CoA Synthetase and N -terminal Amino 
Acid Sequencing— Acyl-CoA synthetase was partially purified from a 



1 The abbreviation used is: kb, kilobase pair(s). 
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500-ml culture of strain BL21(pLysS) harboring the fadD" expression 
plasmid pN324 after induction. Following induction with isopropyl 
/?-D-thiogalactopyranoside, cultures were grown for an additional 2 h 
and cells were harvested by cent rifugat ion. The cell pellets were 
washed twice in minimal medium E, resuspended in 50 mM potassium 
phosphate, pH 8.0, and disrupted by three cycles of sonication at 
0 *C. The sonicated ext ract was clarified by centrifugation (12,000 x 
g for 15 min). The supernatant was centrifuged at 60,000 x g for 2 h, 
and the membrane pellet was discarded. Acyl-CoA synthetase was 
partially purified from the supernatant by batch DEAE-cellulose 
chromatography and by ammonium sulfate fractionation using the 
conditions described by Overath ct al (5) and Kameda ct al (10). The 
final protein sample was subjected to preparative electrophoresis on 
a 12% SDS-polyacrylamide gel and electrophoretically transferred to 
a prewetted polyvinylidene di fluoride (Immobilon Transfer, What- 
man) in 10 mM 3-(cyclohexylamino)-l-propanesulfonic acid, pH 11.0, 
10% methanol (29). Following eiectrophoretic transfer, the position 
of the acyl-CoA synthetase was identified by staining the polyvinyli- 
dene difluoride membrane briefly with Ponceau red. The strip con- 
taining acyl-CoA synthetase was excised, extensively washed with 
high performance liquid chromatography grade water, dried at room 
temperature, and stored at -70 *C. The N-terminal amino acid se- 
quence from this sample was determined using an Applied Bio- 
Systems 470A gas phase protein sequenator equipped with an Applied 
BioSystems Model 120A in-line detector for phenylthiohydantoin- 
derived amino acids from each cycle of Edman degradation at the 
Harvard University MicroChcmistry Facility. 

Mapping the Transcription initiation Site of the fadD Gene— The 
transcriptional start of the fadD gene was identified by primer exten- 
sion of two /flrfD-specific oligonucleotides, 5'-GATAACGGTTA- 
AGCCAAACC-3' and 5 / -CTACCAGAGATTGATAACGG-3' (cor- 
responding to nucleotides 315-334 and 366-385, respectively; see Fig. 
4), hybridized to in uu/o-synthesi?.ed mRNA by reverse transcriptase 
(3). Total in uruo-synthesized RNA was isolated from a 100- ml midiog 
culture of strain RS3010 (fadR) or Kl2 grown in TB (3). The fadD- 
specific oligonucleotides for primer extension were 5' end-labeled 
using [7- 32 Pl ATP and polynucleotide kinase as described by Maniatis 
et al (22). 

Identification of the FadR Binding Site by DNase I Footprinting — 
The 353-base pair SauZA fragment containing the fadD promoter was 
gel-purified and ligated into the BamHl site of Ml3mpl9, Ml3mpl8, 
and pUCl8 thereby generating clones MD21 (to sequence the top 
strand of the fadD promoter-containing fragment), MD20 (to se- 
quence the bottom strand of the fadD promoter-containing fragment), 
and pN330, respectively. Plasmid pN330 was purified and used as a 
source of DNA for gel shifts and DNase I footprinting assays. Protein- 
DNA gel retention assays (gel shifts) and DNase I footprinting were 
carried out essentially as described by DiRusso et al (13) using 
purified FadR. The concentrations of FadR used in these experiments 
ore given in the appropriate figure legends. For gel shifts, the 414- 
base pair EcoRI-HmdIII fragment from pN330 was gel-purified and 
labeled with |or« 32 P]dATP using the Klenow fragment of DNA polym- 
erase (22). The binding of FadR to an |a- 32 PldATP-labeled fragment 
containing the fadB operator was used as an internal control as these 
parameters were previously well defined (13). FadR binding was 
estimated as the conversion of the fast mobility complex (unbound 
DNA) to the slow mobility complex (FadR-bound). Quantitation was 
carried out with a Bio I mage computer-assisted analysis system 
(MilliGen/Biosearch). For DNase I footprinting, pN330 was re- 
stricted with £coR! (top strand) or HindUl (bottom strand) and 
PvuW (cleaves only within the vector), and the promoter-containing 
fragments were gel-purified. Following purification, the WmdW- 
PuuU and EcoHl-PvuU fragments were 5' end-labeled with (y-^P] 
ATP using polynucleotide kinase and restricted with EcoRI and 
A/mdlll, respectively. The appropriate M P- labeled £coRI-//tndIII (top 
strand of the fadD promoter labeled) or WmdW-EcoRI (bottom strand 
of the fadD promoter labeled) fragments were gel -purified and used 
for DNase I footprinting using the conditions described by DiRusso 
et al (13). The concentrations of FadR used in these experiments are 
given in the appropriate figure legends. Clones MD21 and MD20 were 
sequenced using the EcoR I -specific primer and the WmdIII-specific 
primer, respectively, and sequencing reactions were run adjacent to 
the DNase I footprint reactions on a 6% standard sequencing gel to 
accurately position the FadR binding sitc(s) within the fadD promoter 
(17). 

The analyses of the DNA sequence of the fadD gene and the amino 
acid sequences of acyl-CoA synthetase, yeast acyl-CoA synthetase, 
rnt acyl-CoA synthetase, and firefly luciferase were done using the 



Wisconsin Genetics Computer Group programs (35) and DNA In- 
spector II (TextCo Inc., West Lebanon, NH). 

Materials— Reagents and enzymes used for sequencing, transcrip- 
tion mapping, and restriction were purchased from U. S. Biochemi- 
cals, Bethesda Research Laboratories, and New England Biolabs. 
Reagents used for oligonucleotide synthesis were purchased from 
ABN/Biogenex and Pharmacia. (o-^SjdATP, |o- tt P)dATP. |y- M Pl 
ATP, | M Slmethionine, and [ 3 H|oleate were obtained from Du Pont- 
New England Nuclear. Antibiotics and other supplements for bacte- 
rial growth were purchased from Difco and Sigma. All other chemicals 
were obtained from standard suppliers and were of reagent grade. 

RESULTS 

Cloning the fadD Gene— The fadD gene was mapped by 
Overath et al (5) to the 40-min region of the E. coli chromo- 
some (5). Phage from 11 clones (5E12, 4B8, 12H7. 3E12, 9F2, 
7F2, 6D1, 12B3, 15D5, 19H3, and 20H4) of the X gene library 
generated by Kohara et al (18) were transduced into the 
XCI857 lysogen of the fadD strain PN235 (Fig. IA) for com- 
plementation analysis. DNA from clones 7F2 and 6D1 was 
able to complement the fadD88 defect. A-DNA was purified 
from clone 6D1 and restricted with C/oI, and fragments were 
ligated into Cla I -restricted pACYCl77. Plasmid DNA from 
one of the transformants that complemented the fadD8S 
mutation (acquired the ability to grow on the long-chain fatty 
acid oleate (Ole*)) was purified and designated pN300 (Fig. 
IB). This plasmid contained a 3.4-kb Cla\ fragment of ge- 
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Fig. 1. Cloning and sequencing the fadD* gene of E. coli. A, 
the top line represents the E. coli chromosome with the numbers 
representing the position in kilobases. The genetic positions of pabB 
ond tar are given as reference points. The small lines represent the 
chromosomal DNA in the different isolates (numbered according to 
serial number) of the Kohara gene library (23). Clones 7F2 and 6D1 
ore starred to indicate that they were able to complement the fadD88 
defect in strain PN235. B, restriction mop of pN300 and fadD and 
fadD" subclones. The complementation patterns are shown at the 
right, + refers to growth on 5 mM oleate, and - refers to no growth 
on 5 mM oleate. C, sequencing strategy of the fadD* gene. Arrows 
represent M13 subclones of the fadD gene including those isolates 
generated using ExolU. These clones were sequenced using either the 
universal primer or the lacZ -specific primer. Arrows preceded by on 
asterisk (*) indicate the direction and extent of sequencing using 
either M13-derived clones or pNl30 sequenced using /odO-specific 
oligonucleotides. The shaded region (iVcoI-CTal) represents the se- 
quence presented in Fig. 4. 
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nomic DNA. A series of subclones of pN300 were constructed 
to delineate the end points of the fadD gene for further studies 
using DNA sequencing (Fig. \B). One of these subclones, 
pN308, contained a 2.7-kb HindUl-Clal fragment which com- 
plemented fadD88. When a small (500-base pair) TVcoI frag- 
ment was removed from pN308 to generate pN309, the small- 
est subclone complementing the fadD88 defect was generated. 

Acyl-CoA Synthetase Expression— Acy I -Co A synthetase ac- 
tivities were monitored in the wild-type strain K12, the fadR 
strain RS3010, and the fadR fadD strain LS6928 harboring 
the fadD* and fadD plasmHs shown in Fig. IB. As shown in 
Fig. 2i4, acyl-CoA synthetase activity was inducible 2- fold in 
the presence of the long-chain fatty acid oleate in the proto- 
trophic strain K12. Acyl-CoA synthetase activities in the fadR 
strain RS3010 grown under both conditions were comparable 
to the levels found in E. coli K-12 grown in the presence of 
oleate. As expected, no acyl-CoA synthetase activity was 
observed in the fadD fadR strain LS6928. The fadR fadD 
strain LS6928 harboring the fadD and fadD* plasmids pN300\ 
PNI04, pN305, pN306, pN307, pN308, or pN309 had acyl- 
CoA synthetase activities that reflected their complementa- 
tion patterns (Fig. 2/3). 

Identification of the fadD Gene Product and N-terminal 
Amino Acid Sequence Analysis of Acyl-CoA Synthetase — The 
3.4 -kb Cla\ fragment from pN300, containing the entire fadD 
gene, was isolated, ligated with a Clal to BamHl linker, and 
cloned into the T7 expression plasmid pCDl30 (13) to yield 
the plasmids pN321 and pN324 (Fig. 3A). A protein with an 
M T of 62,000 was identified by SDS-polyacrylamide gel elec- 
trophoresis in extracts of cells harboring pN324 following 
induction which was presumed to be acy 1 -Co A synthetase 
(Fig. 3B). A second protein that was poorly produced relative 
to acyl-CoA synthetase with an M r of 22,000 was also identi- 
fied in these extracts which, based on the sequence data 
described below, was presumed to be distinct from the fadD 
gene. Neither protein was produced in cells harboring pN321 
(fadD* in the reverse orientation to the T7 promoter) or 
pCD 130 (plasmid vector)(data not shown). A polypeptide with 
an M t of 15,000 was produced from both constructs as well as 
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Fin. 2. Acyl-CoA synthetase activities. A, induction patterns 
of acyl-CoA synthetase activities in the prototrophic strain K-12, the 
fadR strain R$3010, and the fadR fadD strain LS6928 following 
growth in TB or TB containing 5 mM oleate. £, acyl-CoA synthetase 
activities in the fadD fadR strain LS6928 harboring the collection of 
fadD* and fadD clones illustrated in Fig. IB following growth in TB 
containing 5 mM oleate. The scale on the left indicates acyl-CoA 
synthetase activity expressed in picomoles of o!eoyl-CoA formed/ 
min/mg of protein. The error bars indicate the standard error of the 
mean of three independent experiments. 
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Fig. 3. Ovcrexpression of acyl-CoA synthetase. A, plasmid 
constructions harboring the fadD* gene in the (+) orientation 
(pN324) and the (-) orientation (pN321) relative to the T7 promoter 
(#io) in plasmid pCDl30. B, autoradiograph of acyl-CoA synthetase 
expression following induction with isopropyl-l-thio-0-D-galactopy- 
ranoside, labeling with [^Slmethionine, and SDS-polyacrylamide gel 
electrophoresis; lane I, pN324; lane 2, pN32l; molecular weights 
(xi0~ 3 ) are indicated to the left. 



the plasmid vector and thus presumed to be vector-specific. 
Use of this expression system defined the direction of tran- 
scription of the fadD gene from the Nco\ site toward the Clal 
site as shown in Fig. 1C. This directional alignment of tran- 
scription was confirmed by the DNA sequence of the fadD 
gene. 

The M t = 62,000 polypeptide assumed to be acyl-CoA 
synthetase was partially purified from BL21(plysS)(pN324) 
following induction as described under "Experimental Pro- 
cedures" and subjected to automated N-terminal amino acid 
sequence analysis as described under "Experimental Proce- 
dures." The N-terminal amino acid sequence was shown to be 
Met-Lys-Lys-Val-Trp-Leu-Asn-Arg-Tyr-Pro. 

Sequence of the fadD Gene— The entire 3.4 -kb Clal frag- 
ment from pN300 was sequenced as shown in Fig. 1C. The 
fadD gene was shown by complementation to be localized on 
a 2.2-kb Ncol-Clal fragment. Sequence analysis of this frag- 
ment of DNA as shown in Fig. 4 (2230 base pairs) revealed a 
single open reading frame beginning with ATG at nucleotide 
241 encoding a polypeptide consisting of 580 amino acid 
residues with a molecular weight of 64,406. This open reading 
frame did not encode the N-terminal amino acid sequence 
defined from the purified acyl-CoA synthetase (see above). 
Furthermore, this ATG was five nucleotides upstream from 
the adenine residue defined as the transcriptional initiation 
site by primer extension (see below). Careful analysis of the 
reading frame revealed that the amino acid sequence Leu- 
Lys-Lys-Val-Trp-Leu-Asn-Arg-Tyr-Pro, starting at nucleo- 
tide 307, had 9 of the 10 amino acid residues defined for the 
purified protein. The notable difference was in the N-terminal 
amino acid. The DNA sequence predicted a leucine (TTG) 
while the protein sequence defined a methionine (ATG). We 
propose that the TTG beginning at nucleotide 307 represented 
the initiation codon and encodes a methionine residue (thus 
giving the predicted N-terminal amino acid sequence Met- 
Lys-Lys-Val-Trp-Leu-Asn-Arg-Tyr-Pro). This proposal was 
based on three lines of evidence. First, the data obtained from 
the N-terminal amino acid sequence of the acyl-CoA synthe- 
tase clearly indicated the presence of a methionine at this 
position. Second, the protein sequence beginning at nucleotide 
240 did not result in an amino acid sequence that would 
indicate that this protein was post-translationally modified 
(Lc. cleavage of a signal peptide and/or modification of the 
N-terminal amino acid). Third, UUG can act as an alternative 
initiation codon (30). Assuming that the TTG at nucleotide 
307 encodes the initiation methionine, the predicted size of 
the acyl-CoA synthetase from our expression of pN324 
(62,000) was in close agreement with that predicted from the 
DNA sequence (62,028). The coding sequence for the M f = 
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61 aatcccwjcccccccagactacaaacacttctaactcaataattccttctttttaaaga^ 120 
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J 2 1 AAAOAAACAGC (X>CTGGTVCGCTGT7TC rGi2A TTCTT ACCGT AAAC AT AAAAAT AAA TAfl 1 6 0 

•35 . 0 D j. 10 . 

1 9 1 TCACCCOCTTCGCAACC ITT T CO TI COOTAATTATCA AGCrOgTA TCA TOA GT7AATATT 2 4 0 

241 ATGTTAACCWATCTATATCATTTOCGC^^ 300 

J01 GAAG A ATTG A AO AACOTTTGOCTT A ACCGTT ATC CCG COG A COTTCCO A CQOAGATC AA C 360 
KfaVMt«yiYtITroLt^fttnAinTYlPrg^^*BpV I ProThrGlu 1 1 oAan 

361 CCTO^CCCTrATCAATCTCII^ACATATCTTTCACCAGTCGGTCCCCCCGCTACGCCCA 4 20 
ProA«pArgTyiOln5«rL<iuVttlAcpH9tPh«OluClnS0i VnlCLyAltLauAigArg 

4 21 TC AA C CTGCOTTTCTG A A T ATCfiGGC A GGTAATGACCTTCGC A AGCTCC A AG AACG C AC T 4 80 
SiirTh i CyoVfl 1 CyaG! uTyrGl yGl yC 1 yAanAapLauAi gLysLo\iGluGluAi oSor 

481 CGCQCOTTTUCCCCTT ATTT CCA AC A ACCCTTCGGCCTC A AG A A ACCCC ATCCCGTTGCG *40 
Ai gAlnPhoAloAlnTyt UouGl nClnGlyl.auClyl.oul.yal.yaClyAapAigVal Ala 

■ ■ ■ 4 ■ + 

Ml I'TOATCATGCCTAATTTATTCCAATATCCCCTGCCGCTGTTTGCCATTTTCCCTGCCCGC 600 
LauHoLMflt V»i OAont.ouI-nuGl nTyrProVnl AlaLouPhoGly 1 1 olouAroAl afll y 

6 0 1 ATCATCGTCGTAAACCTTAACC<XrrTGTATACCCCCCOTCACCTTCACCATCACCTTAAC 660 
MoU InynlVAlAsnVnlAsnProt.ouTyrThrPrQArgGluLouGluHisOlnLouAsn 

661 C AT AC COG CCC AT CCCCG ATTGT TAT CGTGT C T A ACTTTGCTC A C AC A CTGC A AAA AGTO "J 20 
AopSorClyAlaSorAla! loVall loValSarAtnPttoAIaHiaThrLouCluLyaVal 

I • ■ ■ * 

721 CTTGATAAAACCGCCGTTCACCACCTAATTCTGACCCCTATCCCCCATCACCTATCTACG 7 80 
VnlAapUvsThi AlBVnlGlnHi»VninoLouThrArgXotGlyABpninLou3«rTh r 

****** 

781 OCA^AAOnCACOOTAaTGAATTTCGTTGTTAAATACATCAAGCGTTTGGTGCCGAAATAC 840 
AlrtLysClyThrVnlVAlAanPhAVAlValljyaTyr I lot.y«AroL«uVnlProLv*Tyr 

84 1 CATCTCCCAGATGCCATTTCATTTCGTAGCGCACTGCATAACGGCTACCGGATGCAGTAC 900 
Ht«l,ouP*oA«pAlo: l«SorPh»Ai-ySofAlrtL<»uHi»AiinGiyTyrArgM»tGlnTyf 

901 (JTCAAACCCGAACTGGTGCCOnAAGATTTAOCTTTTCTGCAATACACCGCCCOCACCACT 960 
Vo \ by • ProG luLouVo 1 p roG 1 u A fl p l,ouA 1 a Pho UouG 1 nTy rTft rO 1 yG 1 yTh rTh v 

96 1 CCTGTGGaJAAAGCCCCGATGCT-CACrCACCGCAATATCCTGGCCAACCTCCAACAGGTT 1020 
GlyVAl Al«T.yttGlvAUMoiL,«uThrHi*Ai OAftnMOt LOuAlflAnnLouGluGlnVal 

1021 AA C G CG A C CT ATCGTC CGCTCTTCC A TCC GGGC A A A G ACCTGGTCCTG AC GGCCCTG CCG 1080 
AenAlaThtTy tGlyPiot.«uUiuHi8ProGlyLyiCluL«uValVdlThrAl«LouPro 
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1081 CTGTATCA C ATTTTTGC CCTC AC C ATT A ACTOCCTCCTCTTT AT CCA ACTGCCT GGGC A G 114 0 
UouTyt HioIlol hoAl alaiuThrl loAanCyBLouLouPhot laOluLouGiyOlyGln 
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114 1 AACCTGCTTATCACTA ACCCCCGCG AT ATTCC AGGGTTGGT A A A AC ACTT AG CGA A ATAT 1200 
A»nLoul,fluUoThrA9nProAi:oA«pHftProGlvt.ouValLyiiGltJtouAlaLyaTvr 

1201 CCOTTT A CCCCT ATC ACCGCCGTT AAC A CCTTCTT C AATCC GTTGCTG A AC A AT AAA G AC 1260 
P roPh oTh r A La 1 1 «iTh r 0 1 y V a 1 AanThr (.ouPhoAanAl ALiOuLouAanAanLyaG 1 u 
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1261 TT CC AGC AC CTGO ATTT CTCCACTCTOC ATCTTTCCG C AGO CGCTCGG ATC C C AGTGC AG 1320 
PhoGLnGlnL.«uAapPh«SornarLQu>UaUauSoi-AlAGly01yGlyMatProValGln 
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1321 CAAGTGGTCOCAGACCGTrCGGTCAAACTGACCGGACAGTATCTCCTCGAAGCCTATGOC U 80 
ainValv<ilAlnGluAroTrpVflll,yol.<»uThrOlyGlnTyrL«uUouGluGlyTyrCiy 
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1381 CTT A CCC AGTCTC CG CCCCTCGT CAG CCTT AAC CC AT ATCA T ATTC ATT ATC AT AGTtXJT 14 4 0 
l.ouThtCluCyoAlaProtouVAldorValAflnProTyi AopnaAupTyrKiflSucGly 

• ♦ » • • « 

1441 AGC ATCGCTTTGC CCCTC CCCT C GA CGG A AGCC AAACT GCTGC ATG ATQ ATCAT AATC AA 1500 
Sari loC)yL»uProVolProS<>rThrOluAiflLyHl,QuV<jlAapA8p Asp AapAonGlu 

1501 OTACCACCACKn , CAACCXK«3TCAC<T1TCTCTCAAAC<5ACCOCACCTCATCCTCCGT^ 1560 
ValProProClyClnPioGlyCluLouCyitVrtlLy«ClyProGLnV«lMotl.ouGlyTy» 

1 S6 1 TGCCAGCGTCCCOATGCrACCGATCAAATCATCAAAAATCGCTGGTTACACACCGGCOAC 1620 
TrpGlnArgProAspAl<iThrAirpGJuI lot lolyaAftnClyTrpMmHisThrClyAap 

« • . . 

1621 ATCCCGGT A ATOG ATGA AG A A GG ATTCCTGCGC ATTCTCGATCGT AAA AAAC AC ATC ATT 1680 
1 1 oA 1 aValMot. AapGluGluGlyPhoLouAt'o I loVal AapArgLynL-yaAapMnt I lo 

1681 CTGGTTTCCCGTTTTAACGTCTATCCAACOAQATTCAAGATCTCCTCATCCAGCATGCTG 17 4 0 
tAtuValSorCLyPhoAanValTyrProThrArgLouLyaMaLSorSorCyaSorKoiVal 

* • * * • 

174 1 GCCT AC AGC AAGTCCCGGCTCTTCGC GT ACCTTCCTGGCTCC AGTGCTG A A GCCGTC A A A 1800 
AtaTyr ArglvysSm Arot-ouLouAlATyrLAUProGlySorSorOl yOluOlyVAl l.ya 

1801 ATCTTCGTACTIXJAAAAAACATCCATCCCTTACTCAACACTCACTCGTGACTT7TTGCCCC 1860 
IloPhoValVnl Lyul.yiAapPtoflnr l.ouThi CI uC 1 uS-j r LouVa I Thr PhoCya A i a 

» • a ♦ « a 

1 86 1 CGTC AGCTC A COOG AT AC A AA CT AC CGA At 5CTOOTCG AGTTTCGTG ATG ACT T A C CG AA A 1 920 
Ai oGlntouThrGlyTyi LyaVAlPi-oLyaLauVAlGluPhoArgAapClutoMPtoLya 

» « 

1921 TCTAACGTCGGAAAAATTTTGCGACGAGAATTACOTGACGAADCGCOGCAAAGTGGACAA 1 98 0 
Sur AsnVAlClyUyal loLouAfoAroGluLouArgAapGluAl aArgGlnSorGlyOln 

• ♦ « • • • 

1981 TAAAOCCTGAGCOTTAAGTCAOTCGTCAC ACGCCGCTT A ATCCQGCGTTTTTTTT CACCC 2040 
End 

204 1 CCACTAAACAOAAAACAATTTCAATTACCAAATCATTACCACCGACGATGCGCTOCCTTC 2100 

• • • • > . 

2101 TTTGTOTG AACCCGTCCOTOCCTTTCCOGCC AT AOCCCTCGATACTC AATTTCTTCCTAC 2160 
2161 CCCCACrTATTACCCCCAGCCXGCGTTCATTCAACTTTTCGATTXXrGACCATCTGCCGCT 2220 
222 L AATCGCATCC 2230 

Fig. 4. Nucleotide sequence of the fadD* gene and the de- 
duced amino acid sequence for acyl-CoA synthetase. +/ refers 
to the transcriptional start defined by primer extension, potential 
-10 and -35 regions are noted, Odi and Om are the fadD* operators 
that bind FadR that are underlined and italicized, and the potential 
ribosome binding site sequence is noted with a double underline. The 
underlined amino acid sequence represents the N-terminal amino 
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Fig. 5. Primer extension of the 5' end-labeled fadD*-spe- 
cific oligonucleotide 5 -GATAACGGTTAAGCCAAACC-3'. 

Lanes G, A, T, and C represent the fadD sequence using the same 
oligonucleotide as a primer; lane 1 represents the primer extended 
fragment using RNA (20 jig) from the fadR strain RS3010, while lane 
2 represents the primer extended fragment using RNA (20 Mg) from 
strain K-12. The sequence to the right indicates the transcriptional 
start at the adenine residue (noted by the shaded arrow and designated 
+ 7). 

22,000 protein identified using the T7 system was shown to 
be contained on the HmdM-yVcoI restriction fragment of 
pN300 (upstream to fadD*; data not shown). This reading 
frame was followed by a sequence that resembled a p- inde- 
pendent terminator. Subcloning experiments demonstrated 
that elimination of this fragment of DNA did not affect 
complementation (both by growth on oleate and acyl-CoA 
synthetase levels) of the fadD88 mutation, and thus this 
polypeptide will not be considered further in this work. 

Primer extension using the the 5' end-labeled oligonucleo- 
tide 5'-GATAACGGTTAAGCCAAACC-3' defined the start 
of transcription of the fadD gene at the adenine residue at 
nucleotide position 246 (Fig. 5). The alignment was confirmed 
by primer extension of a second fadD-specific oligonucleotide 
(5 ' -CTACCAG AG ATTG AT AACGG -3' )(data not shown). By 
convention, this residue has been designated +1 (Fig. 4). 
Potential -10 and -35 regions were noted around nucleotide 
positions 237 and 212, respectively, that were in reasonable 
agreement with those defined for <r-70 responsive promoters 
(31). As expected, the RNA isolated from the fadR strain 
RS3010 resulted in a higher signal following primer extension 
when compared to RNA isolated from the prototrophic strain 
K12 (Fig. 5). Following the translational stop of the fadD 
gene, a nearly perfect GC-rich inverted repeat is present 
followed by a series of 8 thymidine residues (nucleotides 2010 
to 2035) which may act as a p-independent transcriptional 
terminator (Fig. 4). 

fdentification of the FadR Binding Sites, O f >\ and Om — In 
the region just upstream from the transcriptional start site, a 
sequence was identified that shared homology to the operator 
region of the fadB gene defined as the FadR binding site ( 13). 
This region, from -13 to -29 (5' -AGCTGGTATGA TGAGTT- 
3'), was identical in 12 of the 17 base pairs {italicized nucle- 
otides noted above) with the sequence defined as the FadR 
operator of the fadB gene using DNase 1 footprinting ( 13). 
Particularly striking was the sequence CTGGT (-26 to -22) 
which was identical with that defined as the part of the FadR 
operator in both the fadB and fadL genes (13). 2 In order to 
address whether this region was the FadR operator site of the 
fadD gene, a 353-base pair Sot/3A fragment containing this 



'DiRusso, C. C., Metzger. A. K., and Heimert. T. L. (1993) Moi 
Micro., in press. 

acid sequence of acyl-CoA synthetase determined by automated pro- 
tein sequencing. The inverted repeat followed by 8 thymidine residues 
that may represent the p-independent terminator is between nucleo- 
tides 2010 and 2035 and is underlined. 
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region was ligated into Ml3mpl8, M13mpl9, and pUCl8 as 
described under "Experimental Procedures." Using DNA- 
protein gel retention assays, this fragment was shown to bind 
FadR with high affinity indicating it contained a FadR bind- 
ing site (Fig. 6\4 ). The apparent of this fragment for FadR 
was estimated to be I x 10"° M. DNase I footprinting defined 
two FadR operator sites within the fadD promoter-containing 
fragment in pN330 (Fig. 6, B and C). The first FadR operator 
site, designated Od 1( covered 17 base pairs and included the 
sequence defined above at position -29 to -13. The second 
site at position -115 to -98 has been designated (W Within 
Od2 was the sequence CTGGT which was also found in Odi 
as well as the fadB and fadL operator sites. Overall, Od2 had 
9 of 17 nucleotides in common with the proposed FadR 
consensus binding site (13). 2 The region protected by FadR 
from DNase I digestion at Od2 was identified at concentrations 
of FadR that were 10-50-fold higher that that used to identify 
Odi indicating this site had a low binding affinity for FadR 
compared to 0» and O m . 

Comparison of the E. coli Acyl-CoA Synthetase with the Rat 
Acyl-CoA Synthetase, Yeast Acyl-CoA Synthetase, and Firefly 



A . 

ECACS 200 
FAA1 262 
RAT ACS 263 




FlG. 6. DNA-protein gel retention assays and DNase I foot- 
printing of the fadD* promoter demonstrating FadR binding. 

A, DNA-protein gel retention assay of the 4 14 -base pair fragment 
containing the fadD* promoter; lane /, DNA (1 x 10" 12 M of the [a- 
•^PldATP-labeied HinMhEcoRl fragment) with no added FadR; 
lanes 2-1 1 represent 1 x 10~ ,2 M | a - 32 P]dATP- labeled Hindlll-EcoRl 
fragment and increasing concentrations of FadR; 2, 5 x 10~* M FadR; 
:i 1 x 10" R M FadR; 4, 5 x 10"° M FadR; 5, 1 X 10" 9 M FadR; 6\ 5 X 
lO" 10 M FadR; 7, 1 X lO" 10 M FadR; 8, 5 X KT n M FadR; 9, 1 x HT n 
M FadR; I0 t 5 X 10~ ,a m FadR; //, 1 x 10~ 12 M FadR; a refers to the 
DNA fragment that is not FadR-bound, and 6 represents the FadR- 
DNA complex; the higher molecular weight complex observed in lane 
2 is seen only at high concentrations of FadR exceeding I X 10""* M. 

B, DNase I footprint of the top strand. C, DNase I footprint of the 
bottom strand. For both B and C, G, A, T, and C represent the fadD* 
sequence defined using the EcoR\ and HindlU primers, respectively, 
and lanes /, 2, and 3 represent decreasing concentrations of FadR (1 
x 10"' M, 2 x 10"* m, and 4 x 10~° M); lane 4 represents the DNase I 
pattern generated in the absence of added FadR; The fadD* operators 
indicated hy the open boxes are noted as Oi>, and (W 




B. 

ECACS 353 
FAA1 452 
RA7ACS 4 55 
FFLUC 336 



*V*Q Elf 




THE 



. T TUL 



T~t Li 
c c c 




Fig. 7. Homology between the E. coli acyl-CoA synthetase 
{ECACS), the rat liver acyl-CoA synthetase {RATACS), the 
yeast acyl-CoA synthetase (FAA1), and the firefly luciferase 
{FFLC). A, similarities hetween ECACS, FAA1, and RATACS cor- 
responding to amino acid residues 200-273 of the E. coli enzyme. B, 
similarities between ECACS, FAAI, RATACS, and FFLC corre- 
sponding to amino acid residues 353-455 of the E. coli enzyme. 
Regions with amino acid identity are boxed and shaded with dark 
gray, and those regions with high homology are boxed and shaded 
with light gray. 

Luciferase — Using the Genetics Computer Group programs 
BESTFIT and GAP, we compared the deduced amino acid 
sequence from the E. coli acyl-CoA synthetase to that deduced 
for the rat (32) and the yeast enzymes (33) and firefly lucif- 
erase (34). These four enzymes were found to have extensive 
similarities along their entire lengths (48-51% sequence sim- 
ilarity when conservative amino acid substitutions are consid- 
ered). Overall these four enzymes were 24-27% identical. 
Further analysis of these data indicated that two regions of 
these enzymes were highly conserved (Fig. 7). In the first 
region (residues 200-273 of the E, coli acyl-CoA synthetase), 
there was a 32-35% sequence identity which was extended to 
53-67% similarity when conserved residues were included to 
residues 255-327 of the yeast enzyme and residues 262-334 
of the rat enzyme. These was no apparent similarity observed 
between the firefly luciferase and the three acyl-CoA synthe- 
tases in this region. In the second region (amino acid residues 
353-455 of the E. coli enzyme), 34-44% of the amino acid 
residues were identical and 60-65% were similar for all four 
enzymes (Fig. IB). Suzuki et al. (32) proposed that for the rat 
enzyme, this second region may represent the ATP binding 
site. Part of our current research efforts are being directed at 
addressing whether or not this region of the E. coli acyl-CoA 
synthetase actually represents a nucleotide binding region. 

DISCUSSION 

In the present paper, we report the cloning, sequencing, and 
expression of the fadD gene of E. coli encoding acyl coenzyme 
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A synthetase. The fadD gene was identified in clone 6D1 from 
the Kohara gene library and subsequently shown to be en- 
coded within a 2.2-kb Ncol-Clal fragment of genomic DNA 
by complementation analysis. The expression of the fadD 
gene was monitored both by following acyl-CoA synthetase 
activities in the collection of fadD and fadD* plasmids and by 
following induction of the fadD* gene using T7 RNA polym- 
erase. Acyl-CoA synthetase levels were only 2-fold inducible 
in the presence of the long-chain fatty acid oleate which 
differed from the levels of induction observed for other fad 
gene products (acyl-CoA dehydrogenase, enoyl-CoA hydra- 
tase, 0-hydroxyacyl-CoA dehydrogenase, and 0-ketothiolase) 
(5, 26). The DNA sequence of the fadD gene predicted a 
protein with 558 amino acid residues and a molecular weight 
of 62,028 starting with UUG as the translational initiation 
codon. This alignment was confirmed by N -terminal amino 
acid sequence analysis of purified acyl-CoA synthetase. No 
evidence was obtained which indicated the acyl-CoA synthe- 
tase was post-translationally processed (Le. the presence of a 
signal sequence and/or N-terminal amino acid modification). 
The transcriptional initiation site of the fadD gene was deter- 
mined to be an adenine residue 60 nucleotides upstream from 
the initiation site of translation using primer extension of two 
different /odD-specific oligonucleotides. The T7 RNA polym- 
erase experiments estimated the size of the E. coli acyl-CoA 
synthetase to be 62,000 which was in agreement with that 
deduced from the DNA sequence and that defined from earlier 
reports. 

The fadD gene contained two operator sites for the binding 
of FadR. The first (Odi) was slightly upstream (-29 to -13) 
from the transcriptional start and had a relatively high affin- 
ity for FadR (K^ ~ 1 x 10~ e m). The estimated affinity of the 
fadD operator Odi toward FadR (-1 x 10" 8 M) was nearly an 
order of magnitude lower than that defined for the fadB 
promoter (3 X 10" 10 m) (13). This operator site was appropri- 
ately positioned to block transcription when filled by FadR 
as it overlapped the presumptive —10 region. The second 
operator site (Odj) was found 114 base pairs upstream from 
the transcriptional start (-115 to -99). This site had consid- 
erably less affinity toward FadR (K«, £ 1 X 10" 8 M) as 
estimated using DNase I footprinting. Studies are being con- 
ducted to determine the precise contribution of this site as 
well as the contribution of Odi in the expression of the fadD 
gene. 

DiRusso and her colleagues (13) demonstrated that the 
long-chain fatty acyl-CoA molecule is the inducer of the fatty 
acid degradative genes by showing that inclusion of these 
compounds (in nanomolar concentrations) prevented FadR 
binding to the fadB operator site in DN A-protein gel retention 
assays while long-chain fatty acids did not (13). Due to the 
affinity of 0 Dl , it is expected that when this site is filled, 
transcription of fadD 4 " is likely to be turned off or maintained 
at a low basal level. The role of 0 D 2 is less clear; perhaps this 
second site regulates a second promoter upstream from the 
primary promoter identified here. Alternatively, there may be 
cooperative interaction between proteins bound at Odi and 
Od2 that contribute to enhanced repression of the fadD pro- 
moter. When fatty acids are present in the growth media, the 
acyl-CoA synthetase enzymatically produces an increased in- 
tracellular pool of long-chain fatty acyl-CoA molecules that 
results in the derepression of transcription. The net result is 
coordinate induction of transcription of the genes involved in 
fatty acid transport, activation, and degradation, including 
fadD. 

The presence of a UUG translation initiation codon for 
acyl-CoA synthetase was noted in the course of the present 



study. This initiation codon is relatively rare (found in 1% of 
E. coli genes) and in some cases acts to down- regulate the 
expression of a given protein (30). It is plausible that the 
production of acyl-CoA synthetase may also be down-regu- 
lated. In the case of the rnd gene of E. coli (encoding RNase 
D), replacement of the native UUG with AUG results in an 
11-fold increase in RNase D expression (34). We are presently 
investigating whether the acyl-CoA synthetase activity is 
subject to comparable regulation. 

Acyl-CoA synthetase is crucial for the uptake of exogenous 
long-chain fatty acids that are destined to be utilized as a 
source of carbon and metabolic energy. This enzyme has been 
proposed to vectorially transport long-chain fatty acids across 
the inner membrane with a concomitant thioesterification to 
the CoA derivatives. Acyl-CoA synthetase functions by gen- 
erating a fatty acid- adenylate intermediate which in turn is 
converted into a fatty acyl-CoA. In this respect, this enzyme 
is likely to bind ATP (see "Discussion" below) and thus may 
be functionally analogous to the ATPase component of bac- 
terial permeases that represents class of transport proteins 
collectively referred to as "traffic ATPases" (36). There is 
evidence that suggests a H + /long-chain fatty acid co-trans- 
porter is present in the inner membrane (6, 7). If this is the 
case, this postulated component must interact directly with 
the acyl-CoA synthetase in the vectorial transport of long- 
chain fatty acids. 

In the course of our analysis of the fadD gene, we compared 
the deduced amino acid sequence of the E. coli acyl-CoA 
synthetase to the rat liver acyl-CoA synthetase, the yeast 
acyl-CoA synthetase, and firefly luciferase. Although the four 
proteins shared significant similarity along their entire 
lengths, a higher degree of similarity in the amino acid se- 
quences from these four proteins was identified toward their 
carboxyl ends (amino acid residues 353-455 of the E. coli 
enzyme). Due to the common mechanism of action of these 
proteins, it seems likely that this region may be of functional 
importance. Suzuki et al (32) proposed that this region of the 
rat acyl-CoA synthetase represented the ATP binding do- 
main. This proposal is, in part, based on the similar enzyme 
mechanism proposed for firefly luciferase. The firefly lucif- 
erase reaction, like the acyl-CoA synthetase reaction, proceeds 
by a two-step mechanism that results in the formation of an 
adenylated intermediate. In both the firefly luciferase and the 
acyl-CoA synthetase enzyme mechanisms, there is a reaction 
between the carboxyl group of the substrate (lucifern and 
long-chain fatty acid, respectively) and ATP to form the 
adenylated intermediate. In both cases, this activation reac- 
tion requires the pyrophosphorolysis of ATP. The formation 
of an adenylated intermediate requires that ATP bind tran- 
siently to the enzyme as part of the catalytic cycle (8, 32). 
The second region of the E. coli acyl-CoA synthetase that 
shared similarity with the yeast and rat enzymes was more 
centrally located along the linear amino acid sequence of 
these proteins (amino acid residues 200-273 of the E. coli 
enzyme). As this homology was not seen with the firefly 
luciferase, this region may specify a component unique to 
acyl-CoA synthetases (e.g. fatty acid binding domain and/or 
the coenzyme A binding domain). The significance of these 
similarities with respect to the function of the E. coli acyl- 
CoA synthetase is presently under investigation. 
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Summary 

The alkBFGHJKL and alkST operons encode 
enzymes that allow Pseudomonas putfda (oleovo- 
rans) to metabolize alkanes. In this paper we report 
the nucleotide sequence of a 4592 bp region of the 
alkBFGHJKL operon encoding the AlkJ, AlkK and 
AlkL polypeptides. 

The alkJ gene encodes a protein of 59 kllodaltons. 
The predicted amino acid sequence shows signifi- 
cant homology with four flavin proteins: choline 
dehydrogenase, a glucose dehydrogenase and two 
oxidases. AlkJ is membrane-bound and converts 
aliphatic medium-chain-length alcohols into aldehy- 
des. The properties of AlkJ suggest that It Is linked to 
the electron transfer chain. AlkJ is necessary for 
growth on alkanes only in P. putida alcohol dehydro- 
genase (AIcA) mutants. 

AlkK is homologous to a range of proteins which 
act by an ATP-dependent covalent binding of AMP to 
their substrate. This list Includes the acetate, 
coumarate and long-chain fatty acid CoA ligases. The 
alkK gene complements a fadD mutation In 
Escherichia coll, which shows that it indeed encodes 
an acyl-CoA synthetase. AlkK is a 60 kllodalton pro- 
tein located in the cytoplasm. 

AlkL is homologous to OmpW, a Vibrio cholerae 
outer membrane protein of unknown function, and a 
hypothetical polypeptide encoded by yff4 In E. coll. 
AlkL, OmpW and Ytt4 all have a signal peptide and 
end with a sequence characteristic of outer mem- 
brane proteins. The alkL gene product was found In 
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the outer membrane of £. coll W3110 containing the 
a/Jr-genes. The alkL gene can be deleted without a 
clear effect on growth rate. Its function remains 
unknown. 

The G+C content of the alkJKL genes Is 45%, identi- 
cal to that of the atkBFGH genes, and significantly 
lower than the G+C content of the OCT-plasmid and 
the ft putida chromosome. 



Introduction 

The OCT-plasmid (Chakrabarty etal. t 1973) allows Pseu- 
domonas oleovorans to utilize alkanes as sole carbon 
and energy source. Two operons located on this plasmld; 
alkBFGHJKL (Eggink et ai, 1 987a; Kok et ai, 1 989b) and 
alkST (Eggink et ai, 1988; 1990) encode the enzymes 
necessary to convert alkanes to fatty acids. 

The first step in alkane catabollsm; hydroxylation of 
alkanes to alkanols, is catalysed by the alkane hydroxy- 
lase system (Peterson et a/., 1966). This enzyme system 
consists of the alkB, alkG and alkT gene products alkane 
hydroxylase, rubredoxin and rubredoxin reductase (Kok 
et ai, 1989a,b; Eggink et ai, 1990). The alkH gene 
encodes an NAD-dependent aldehyde dehydrogenase 
(Kok et ai, 1989b) # and AlkS regulates expression of the 
alkBFGHJKL operon (Eggink et ai, 1988). AlkF is a non- 
functional rubredoxin (Kok etai, 1989b). 

The distal part of the operon, earlier known as alkC 
(Owen etai, 1984), encodes three peptides identified by 
Eggink etai (1987a), and designated AlkJ (58 kDa), AlkK 
(59 kDa) and AlkL (20 kDa) (Kok et ai, 1989b). Marker 
rescue experiments of alcohol dehydrogenase mutants 
(Owen et ai, 1984) mapped the alcohol dehydrogenase 
function to the alkJ gene. The alcohol dehydrogenase 
activity was lost in pGEc41 where the distal three cistrons 
of the alkBFGHJKL operon were deleted (Eggink et ai, 
1987b). No mutations that affect alkane utilization have 
been mapped to the alkK and alkL genes. The above data 
are summarized in Fig. 1 A and B. 

In this paper we present the nucleotide sequence of the 
alkJKL region and provide evidence for the role of the 
peptides encoded by this region in alkane oxidation. 
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Results 

Nucleotide sequence of the alkJKL region 

The nucleotide sequence of the alkJKL region was deter- 
mined following the strategy depicted in Fig. 1E. Three 
open reading frames (ORFs) were found (Fig. 2). The 
position and length of these ORFs, the predicted sizes of 
the corresponding encoded peptides, potential ribosomal 
binding sites and G+C content are shown in Table 1 . No 
other large ORFs could be detected downstream of the 
AlkL ORF in a sequence of 377 nucleotides. The 



Fig. 1. Genetic structure of the alkJKL region and 
strategy in determining its sequence. 

A. Previous genetic experiments indicate that alkj 
encodes an NAD-independent dye-Qnked alcohol 
dehydrogenase. Triangles indicate Tn 7 insertions 
which inactivate the alcohol dehydrogenase 
(FennewakJ era/.. 1979), and boxes indicate the 
approximate positions of two-point mutations In 
the alcohol dehydrogenase gene (as determined 
by Owen et al., 1984 and relative to the DNA 
sequence published by Kok era/., 1989a,b). 

B. Location of 3' end of the insert of pGEc41 . 

C. Position of relevant restriction sites. Previous 
R-loop experiments indicated that the mRNA 
ends around position 8100, pfus or minus 600 bp 
(Eggink et al, 1987a). 

D. Location of the atkJ. alkK, alkL and the 
neighbouring affcWopen reading frames, the 
inferred molecular masses ( xl 0" 3 ) of AlkH, AlkJ, 
AlkK and AlkL, and (in parenthesis) the apparent 
molecular masses ( x 10" 3 ) of these peptides, as 
determined in miniceli expression experiments 
(Eggink etal, 1987a). 

E. Sequence strategy. Part of the sequence was 
determined by primer walking (vertical bars). The 
arrows indicate the direction and extent of 
sequence determination of individual clones. 

sequences of the alkBFGHJKL operon and its translation 
products AlkJ, AlkK and AlkL have been submitted to the 
EMBL Data Library, and are available under accession 
number X65936 (P. oteovorans genomic DNA + OCT 
plasmid). 

Comparison of sequence information to previous R-loop 
and peptide mapping experiments 

We have previously analysed the alkBFGHJKL operon for 
translation products in E. coli minicells (Eggink et al., 



Table I. Properties of the atkJ, alkK and alkL 
genes and gene products. 



afkJ 
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Sequence 
Coding region 
Size of protein 



4548-6221 

558 residues (60.9 kDa) 



G+C content of region 45.6% 

Ribosome-bJnding site CGAGAA(6)AUQ 

Distance to preceding 4 1 bp 
open-reading-frame 



Minicel? experiments' 
Size of protein" 
Mapped between 6 



58 kDa 
4500-6150 



Gsnstic and biochemical data 
Size of protein 59 kDa 

atk mutations 0 alkC1026, alkCl038 

Function Alcohol dehydrogenase 

Localization Cytoplasmic membrane 

(peripheral) 



alkK 



alkL 



6284-7921 8026-8715 
546 residues (59.3 kDa) 230 (203) residues 

25.0 (22.1) KDa 



45.4% 

UGAGG(7)AUG 
63 bp 



43.0% 

CGAGGG(7)AUG 
104 bp 



59 kDa 

61 50-8000 (San) 



60 kDa 
None 

Acyl-CoA synthetase 
Cytoplasm 



20 kDa 

7921 (H/ndlll)-9092 



25 kDa 
None 
Unknown 
Outer membrane 



a. Eggink etal (1987a). 

b. based on co-ordinates and restriction sites in Eggink ef al. (1987a). 

c. Owen etal. (1984). 
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4506 4548 

GTT AAG TAT TTG ACC TGAGTAATTTTTTCCATAAATCATTACCTCCACAAGATAAA ATG TAC GAC TAT 455 

Val Lys Tyr Leu Ser MET Tyr Asp Tyr 4 

ATA ATC GTT GGT GCT GGA TCT GCA GGA TGT GTG CTT GCT AAT CGT CTT TCG GCC GAC CCC 461 

lie He Val Gly Ala Gly Ser Ala Gly Cys Val Leu Ala Asn Arg Leu Ser Ala Asp Pro 24 

TCT AAA AGA GTT TGT TTA CTT GAA GCT GGG CCG CGA GAT ACG AAT CCG CTA ATT CAT ATG 467 

Ser Lye Arg Val Cye Leu Leu Glu Ala Gly Pro Arg Asp Thr Asn Pro Leu lie His Met 44 

CCG TTA CGT ATT GCT TTG CTT TCA AAT ACT AAA AAG TTG AAT TGG GCT TTT CAA ACT GCG 473 

Pro Leu Gly He Ala Leu Leu Ser Asn Ser Lys Lys Leu Asn Trp Ala Phe Gin Thr Ala 64 

CCA CAG CAA AAT CTC AAC GGC CCG AGC CTT TTC TGG CCA CGA GGA AAA ACG TTA GGT GGT 479 

Pro Gin Gin Asn Leu Asn Gly Arg Ser Leu Phe Trp Pro Arg Gly Lys Thr Leu Gly Gly 84 

TCA AGC TCA ATC AAC GCA ATG GTC TAT ATC CGA GGG CAT GAA GAC GAT TAC CAC GCA TGG 485 

Ser Ser Ser He Asn Ala Met Val Tyr He Arg Gly His Glu Asp Asp Tyr His Ala Trp 104 

GAG CAG GCG GCC GGC CGC TAC TGG GGT TGG TAC CGO GCT CTT GAG TTG TTC AAA AGO CTT 491 

Glu Gin Ala Ala Gly Arg Tyr Trp Gly Trp Tyr Arg Ala Leu Glu Leu Phe Lys Arg Leu 124 

GAA TGC AAC CAG CGA TTC GAT AAG TCC GAG CAC CAT GGG GTT GAC GGA GAA TTA GCT GTT 497 v 

Glu Cye Asn Gin Arg Phe Asp Lys Ser Glu His His Gly Val Asp Gly Glu Leu Ala Val 144 ' 

AGT GAT TTA AAA TAT ATC AAT CCG CTT AGC AAA GCA TTC GTG CAA GCC GGC ATG GAG GCC 503 

Ser Asp Leu Lys Tyr He Asn Pro Leu Ser Lys Ala Phe Val Gin Ala Gly Met Glu Ala 164 

AAT ATT AAT TTC AAC GGA GAT TTC AAC GGC GAG TAC CAC GAC GCC GTA GGG TTC TAT CAA 509 

Asn He Asn Phe Asn Gly Asp Phe Asn Gly Glu Tyr Gin Asp Gly Val Gly Phe Tyr Gin 184 

GTA ACC CAA AAA AAT GGA CAA CGC TGG AGC TCG GCG CGT GCA TTC TTG CAC GGT GTA CTT 515 

Val Thr Gin Lys Asn Gly Gin Arg Trp Ser Ser Ala Arg Ala Phe Leu Hia Gly Val Leu 204 

TCC AGA CCA AAT CTA GAC ATC ATT ACT GAT GCG CAT GCA TCA AAA ATT CTT TTT GAA GAC 521 

Ser Arg Pro Asn Leu Asp He He Thr Asp Ala His Ala Ser Lys He Leu Phe Glu Asp 224 

CGT AAG GCG GTT GGT GTT TCT TAT ATA AAG AAA AAT ATG CAC CAT CAA GTC AAG ACA ACG 527 

Arg Lys Ala Val Gly Val Ser Tyr He Lys Lys Asn Met His His Gin Val Lys Thr Thr 244 

AGT GGT GGT GAA GTA CTT CTT AGT CTT GGC GCA GTC GGC ACG CCT CAC CTT CTA ATG CTT 533 

Ser Gly Gly Glu Val Leu Leu Ser Leu Gly Ala Val Gly Thr Pro His Leu Leu Met Leu 264 

TCT GGT GTT GGG GCT GCA GCC GAG CTT AAG GAA CAT GGT GTT TCT CTA GTC CAT GAT CTT 539 

Ser Gly Val Gly Ala Ala Ala Glu Leu Lys Glu His Gly Val Ser Leu Val His Asp Leu 284 

CCT GAG GTG GGC AAA AAT CTT CAA GAT CAT TTG GAC ATC ACA TTG ATG TGC GCA GCA AAT 545 

Pro Glu Val Gly Lys Asn Leu Gin Asp His Leu Asp lis Thr Leu Met Cys Ala Ala Asn 304 

TCG AGA GAG CCG ATA GGT GTT GCT CTT TCT TTC ATC CCT CGT GGT GTC TCG GGT TTG TTT 551 

Ser Arg Glu Pro He Gly Val Ala Leu Ser Phe He Pro Arg Gly Val Ser Gly Leu Phe 324 

TCA TAT GTG TTT AAG CGC GAG GGG TTT CTC ACT AGT AAC GTG CCA GAG TCG GGT GGT TTT 557 

Ser Tyr Val Phe Lys Arg Glu Gly Phe Leu Thr Ser Aen Val Ala Glu Ser Gly Gly Phe 344 

GTA AAA AGT TCT CCT GAT CGT GAT CGG CCC AAT TTG CAG TTT CAT TTC CTT CCA ACT TAT 563 

Val Lys Ser Ser Pro Asp Arg Asp Arg Pro Asn Leu Gin Phe His Phe Leu Pro Thr Tyr 364 

CTT AAA GAT CAC GGT CGA AAA ATA GCG CGT GGT TAT GGT TAT ACG CTA CAT ATA TGT GAT 569 

Leu Lys Asp His Gly Arg Lys He Ala Gly Gly Tyr Gly Tyr Thr Leu Hia He Cys Asp 384 

CTT TTG CCT AAG AGC CGA GGC AGA ATT GCC CTA AAA AGC GCC AAT CCA TTA CAG CCG CCT 575 

Leu Leu Pro Lys Ser Arg Gly Arg He Cly Leu Lys Ser Ala Asn Pro Leu Gin Pro Pro 404 

TTA ATT GAC CCG AAC TAT CTT AGC GAT CAT GAA GAT ATT AAA ACC ATG ATT GCG GGT ATT 581 

Leu He Asp Pro Aan Tyr Leu Ser Asp His Glu Asp He Lys Thr Met He Ala Gly He 424 

AAG ATA GGG CGC GCT ATT TTG CAG GCC CCA TCG ATG CCG AAG CAT TTT AAG CAT GAA GTA 587 

Lys He Gly Arg Ala He Leu Gin Ala Pro Ser Met Ala Lys Hie Phe Lys His Glu Val 444 

GTA CCG GGC CAG GCT GTT AAA ACT GAT GAT GAA ATA ATC GAA GAT ATT CGT AGG CGA GCT 593 

Val Pro Gly Gin Ala Val Lys Thr Asp Asp Glu He He Glu Asp He Arg Arg Arg Ala 464 

GAG ACT ATA TAC CAT CCG GTA GCT ACT TCT AGC ATG GGT AAA GAT CCA GCG TCA GTT GTT 599 

Glu Thr He Tyr His Pro Val Gly Thr Cys Arg Met Gly Lys Asp Pro Ala Ser Val Val 484 

GAT CCG TGC CTC AAG ATC CGT GGG TTC CCA AAT ATT AGA GTC GTT GAT GCG TCA ATT ATG 605 
Asp Pro Cys Leu Lys He Arg Gly Leu Ala Asn He Arg Val Val Asp Ala Ser He Met 504 

Rg. 2. 
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CCG CAC TTG GTC GCG GGT AAC ACA AAC GOT CCA ACT ATT ATG ATT GCA GAA AAT GCG GCA 611 
Pro Hia Leu Val Ala Gly Aon Thr Aon Ala Pro Thr lie Mat Ilo Ala Glu Aon Ala Ala 524 

GAA ATA ATT ATG CGG AAT CTT GAT GTC GAA GCA TTA GAG GCT AGC GCT GAG TTT CCT CGC 617 
Glu He He Met Arg Asn Leu Asp Val Glu Ala Leu Glu Ala Sec Ala Glu Phe Ala Arg 544 

6221 

GAG GGT GCA GAG CTA GAG TTG GCC ATG ATA GCT GTC TCC ATG TAAAAAACATGGTCAATAGATGG 624 
Glu Gly Ala Glu Leu Glu Leu Ala Met He Ala Val Cye Met 558 

6284 



TTTTTTAATCAACATAAATCATCAATGTGAGGCGACCTG ATG TTA GGT CAG ATG 

MET Leu Gly Gin Met 


ATG 
Met 


CGT 
Arg 


AAT 
Asn 


CAG 
Gin 


TTG 
Leu 


631 
10 


GTC 
vai 


ATT 
lie 


GGT 
Gly 


TCG 

Cam 

ser 


CTT GTT GAG CAT 

T At* If A 1 m*9\ mm U 1 mm 

Leu val Glu His 
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Glu 
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Val 


TCA 
Ser 


637 
30 
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Glu 


CTT 
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TCT 

OBIT 


GCA TTG GGC AAG 
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649 
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Tin « III a » a«i 
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Gly 
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GGA 

Gly 


ATG 
Met 


GTA 

val 


655 
90 
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CAT 

mm • 
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ACA 
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ATC 
He 


AAT CCG AGO CTT 
Asn Pro Arg Leu 


TTC ATT GAG CAG ATC ACA TAT 

— _ mm — at — m — m — 

Phe He Glu Gin He Thr Tyr 


GTG 
Val 


ATA 
He 


AAC 
Asn 


CAT 
His 


GCG 
Ala 


661 
110 


GAG 
Glu 


GAT 
Asp 


AAG 
Lys 


GTA 
Val 


GTA CTT CTT GAT 
Val Leu Leu Asp 


GAT ACG TTC TTG CCA ATC ATT 
Asp Thr Phe Leu Pro He He 


GCT 
Ala 


GAG 
Glu 


ATT 
He 


CAC 
His 


GGT 
Gly 


667 
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TCG 
Ser 


TTA 
Leu 


CCA 
Pro 


AAA 
Lys 


GTC AAG GCG TTT 
Val Lya Ala Phe 


GTC TTG ATG GCT CAT AAT AAT 
Val Leu Met Ala His Asn Asn 


TCA 
Ser 


AAT 
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Ala 


TCT 
Ser 


CCT 
Ala 


763 
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CAA 
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ATG 
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Pro 
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Gly 
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Thr 
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val 
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Trp Gly Thr Pro 
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Ala 
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Val 


CTT 
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CCT. 
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GGT 
Gly 
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Ser 
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Glu 
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Ser 


ATG 
Met 
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AGA 
Arg 
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Glu 


TTT 
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AAC GAT ATA TAT 
Asn Asp He Tyr 


GGT GTT GAA GTT ATT CAT GCT 
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ATG 
Met 


GCG 
Ala 


745 
390 
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GTT ATT AAA GAT 
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751 
410 


GGT 
Gly 


TGG 
Trp 


TTT 
Phe 


TCA 
Ser 


ACT GCA GAC GTG 
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763 
450 



Fig. 2. 
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Arg 
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7921 






TCT 


TCT 


GTC 


TCT 


GAA 


TAA CCTTTCTGTAT 


793 



546 



GGGCTTTGACTCATTTTra^ 8Q1 

fiGGSTACCACA ATG AGT TTT TCT AAT TAT AAA GTA ATC GCO ATG CCG GTG TTG GTT GCT AAT 807 

MET Ser Phe Ser Aen Tyr Ly. Val lie Ala Met Pro Val Leu Val Ala Aen 17 

TTT CTT TTG GGG GCG GCC ACT GCA TGG GCG AAT GAA AAT TAT CCG GCG AAA TCT GCT GGC 813 

Phe Val Leu Gly Ala Ala Thr Ala Trp Ala Asn Glu Aan Tyr Pro Ala Lye Ser Ala Gly 37 

™l ™l 5? T 9*° 700 0TC CCT AGC TTC AAT TTT TCT AAG GTC TAT GTG GOT GAG GAG 819 

Tyr Asn Gin Gly Aep Trp Val Ala Ser Phe Asn Phe Ser Lys Val Tyr Val Gly Glu Glu 57 

CTT GGC GAT CTA AAT GTT GGA GGG GGG GCT TTG CCA AAT GCT GAT GTA AGT ATT GGT AAT 825 

Leu Gly Aep Leu Asn Val Gly Oly Gly Ala Leu Pro Asn Ala Asp Val Ser lie Gly j£n 11 

ASD Thr 5°° ll T S AT tT° GCC TAT TTT CTT ACC TCA ATA GC0 CTO GAT TTT 831 

Asp Thr Thr Leu Thr Phe Asp lie Ala Tyr Phe Val Ser Ser Asn He Ala Val Asp Phe 97 

?™ 2 1 ? S CA GCT AGC GCT *** TTT CAA CGT GAG AAA TCA ATC TCC TCG CTG GGA 837 

Phe Val Gly Val Pro Ala Arg Ala Lye Phe Gin Gly Glu Lys Ser He Ser Ser Leu Gly 117 

too VaT Ser lu> SIT t hT I AC S? C CCT 6CA CTT TCG CTT CAA TAT CAT TAC GAT AGC 843 

Arg Val Ser Glu Val Asp Tyr Gly Pro Ala He Leu Ser Leu Gin Tyr His Tyr Asp Ser 137 

17* GA0 C0A CTT TAT CCA TAT GTT GGC GTT CCT CTT GCT CCG GTC CTA TTT TTT GAT AAA 849 

Phe Glu Arg Leu Tyr Pro Tyr Val Gly Val Gly Val Gly Arg Val Leu Phe ill Asp 151 

n AC 2? T ? CT TTG AGT TCO TTT CAT ATT AAG GAT AAA TGG GCG CCT GCT TTT CAG GTT 8S5 

Thr Asp Gly Ala Leu Ser Ser Phe Asp He Lye Asp Lys Trp Ala Pro Ala Pne £n VaT 171 

GCC CTT AGA TAT GAC CTT GGT AAC TCA TGG ATG CTA AAT TCA CAT CTG CGT TAT ATT CCT 861 

Gly Leu Arg . Tyr Asp Leu Oly Aen Ser Trp Met Leu Asn Ser Asp Val Arg Tyr lie Pro 197 

He SOS ?E Aan SI? S 0 * % T £u T f TT °? C CCG GTT CCT CTT TCT ACT ATT GAG GTT 867 

Phe Lys Thr Asp Val Thr Gly Thr Leu Gly Pro Val Pro Val Ser Thr Lys He Glu Val 217 

8715 

Asd So tTI J™ « CT f 17 °? T GC ° TCA TAT CTT TTC TAA GTAATCAGGTCTGTCACTGTCGC 874 

Asp Pro Phe He Leu Ser Leu Oly Ala Ser Tyr Val Phe 230 

AGCACTCTTTCCATCCATTAGTGTGCCGTCTCACAAATAATGCT 882 

CTGTATCTTGTTCAAAATATCTTCCCCCTTCGCGTTCCAGACGTAGCGAGTCGGCTGCGCATTT^ 889 

G TGGTAATCG ACCTTTCC AGTTCG CG AACCG AACTG AAACTGCC ATC ACG CAGGT AC ACCGTG ATATCGCG AAAG AAG C 897 

GTTCAACCATGTTCATCCACGAACTCGAGGTCWGOOTGAAATGCATCTGGAAGCGTTTGT^ 905 
CACTTTCGCATCCTTGTGCGTGGCGTACTTGTCGA 9092 

mL^Iif 1 !?i ide ^3 UGnC8 ? me aBUKL rGfll0n - ™ S continues that reported in Kok el a/. (1 989a.b). Nucleotide positions of the first and last 

nucieot de ot the ORFs are shown above the sequence. Nucleotide and amino acid positions are shown to the right of the sequences. The putatve Shine 
and Dalgarno boxes are underlined (Shine and Dalgamo, 1975). These sequence data appear in the EMBUGenBank/DDBJ Nucleotide Sequence Data 
Libraries under the accession number X65936. 



1987a). Starting from the promotor the detected polypep- 
tides had molecular masses of 41 (AlkB), 15 (AlkF), 49 
(AlkH), 58 (AlkJ), 59 (AlkK), and 20 kDa (AlkL), respec- 
tively. The positions of the ORFs and the calculated 
molecular masses of the peptides encoded by the alkJKL 
ORFs are shown in Fig. 1D. Both gene location and 
inferred peptide size are in accordance with the minicell 
experiments, except for a difference of 5 kDa in the 



molecular mass of AlkL, as can be seen in Table 1 . 

According to R-loop experiments the alkBFGHJKL 
operon ends at position 8100 ± 600 bp (Eggink et a/., 
1987a). The most distal position (8700) is close to the end 
of AlkL ORF at position 8715. 

In our earlier minicell experiments the alkj cistron 
encoded two peptides with molecular masses of 58 kDa 
or 37 kDa (Eggink et a/., 1987a). The 37 kDa peptide 
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i — Mkj 



i — BetA 

I GLD 

— AOX 

GOD 

Fig. 4. Dendrogram of the alignment of AlkJ with dehydrogenases and 
oxidases. The distance to the branching points Is a measure for relative 
evolutionary distance. The figure was constructed using the pc/gene pro- 
gram CLUSTAL. 

was clearly derived from the 58 kDa peptide, as shown 
by limited proteolysis with Staphylococcus aureus V8 
protease. Subsequent experiments have shown that In 
* a number of plasmids, all derived from pGEc59 and 
pGEc60, a deletion of one base (A) from a stretch of As 
had occurred, resulting in an ORF of 371 amino acids 
(40.7 kDa). Plasmids pGEcSO and pGEc61 were not 
derived from pGEc59 or pGEc60 and produced a 58 kDa 
protein in minicell experiments (Eggink et a/., 1987a). The 
sequence of Fig. 2 shows that this region encodes a 
polypeptide with a calculated molecular mass of 60.9 
kDa. 



Nucleotide composition and codon usage of the alkJKL 
genes 

The average G+C content of the alkJKL genes is 45 %, 
which is comparable with that of the alkBFGH genes 
(46-^7 %) (Kok et a/., 1989b), but much lower than the 
62-67 % found for the P. putida chromosome (Mandel, 
1966) and the lncP2 plasmids including OCT (Fennewald 
et a/., 1978). The region following the alkL stop codon 
shows an increase in G+C content to.54 %, clearly higher 
than the G+C content of the alk genes. Together with the 
absence of an ORF downstream of AlkL, and the R-loop 
experiments (Eggink era/., 1 987a), this indicates that alkL 
is indeed the last gene of the alkBFGHJKL operon. 

The unusual nucleotide composition is reflected in the 
codon usage of these genes. Unlike most Pseudomonas 



genes the a//c-genes show no preference for G+C over 
A+T at the wobble position (data not shown). As this does 
not prevent AlkB from being produced to high levels in 
Pseudomonas and E. colt, the unusual codon usage 
apparently does not affect translation significantly. 

Primary structure of AlkJ 

We compared the primary structure of AlkJ with the pro- 
tein sequences compiled in the Protein Sequence Data 
Base (Dayhoff et a/., 1983; Lipman and Pearson, 1985). 
AlkJ is homologous with E. coli choline dehydrogenase 
(BetA; Lamark etai t 1991), Drosophila melanogasterand 
D. pseudoobscura glucose dehydrogenase (GLD; Kras- 
ney ©fa/., 1990), Hansenuta polymorphs alcohol oxidase 
(AOX; Ledeboer et a/., 1985) and Aspergillus niger glu- 
cose oxidase (GOD; Frederick et a/., 1990). An alignment 
of these sequences, and the homology to AlkJ expressed 
as a percentage positional sequence identity, is shown in 
Fig. 3. The relative evolutionary distance between these 
proteins is shown as a dendrogram in Fig. 4. 

All of these proteins, including AlkJ, possess a charac- 
teristic fingerprint for ADP binding (Wierenga etal., 1986) 
at, or close to, their amino terminus (Fig. 5). The two oxi- 
dases and glucose dehydrogenase are flavin proteins. In 
case of BetA this has not yet been determined. However, 
since these proteins are all NAD(P)-independent and 
possess the same fingerprint it is likely that BetA and AlkJ 
are flavin proteins as well. 

The ADP-binding fingerprint consists of 1 1 amino acid 
positions at which specific amino acids occur (Fig. 5). 
Sequences that contain the expected amino acids at only 
nine or 10 of the positions can still be ADP-binding pot|3- 
folds. Interestingly, AlkJ, BetA, GLD, AOX and GOD all 
have an aspartic acid residue at the first position of the 
fingerprint, a deviation from the consensus. In fact, we 
found that more than half of the flavin proteins in the 
Swiss-Prot database (Release 18) that have a recogniz- 
able ADP-binding fold (about 60), have an aspartate at 
this position. At the second position of the fingerprint, 
AlkJ, BetA and GOD have a tyrosine residue. This again 
is not in agreement with the consensus fingerprint, but it 
is also found in proteins such as the glutathione and 
mercury reductases. 
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Rg. 6. Comparison of amino acids 1 85-1 93 In 
AlkK and the conserved sequence in adenylate- 
forming enzymes (the putative AMP-binding pat- 
tern), with P-loop sequences In two protein fami- 
lies. The shaded residues are conserved between 
most of the sequences. X is any amino add. The 
underlined G is conserved between all different P- 
loops (Saraste et al, 1 990), but Is In most cases 
replaced by a proline in the putative AMP-blncfing 
pattern (Balroch, 1992). 



Phenotype of strains equipped with recombinant alk 
piasmids; function ofAlkJ 

The OCT-plasmid-encoded particulate alcohol dehydro- 
genase AIcO was studied by Benson and Shapiro (1976) 
and localized to the alkC region of the alkBAC operon by 
marker-rescue experiments (Owen et ai t 1984). The 
mutations and transposon insertions that affect the 
expression of the alcohol dehydrogenase are shown in 
Fig. 1A. In minicell experiments the alcohol dehydroge- 
nase function appeared to correspond to a 58 kDa 
polypeptide encoded by alkC (Eggink et a/., 1987a), 
which was later named AlkJ (Kok et a/., 1989b). To con- 
firm this assignment we deleted a large part of the alkJ 
ORF. P. putida and E. coli strains containing recombinant 
alk piasmids were plated on minimal medium and tested 
for growth on octane or 1-octanol vapour. Table 2 shows 
the results obtained for strains carrying pGEc47AJ, wild- 
type P. oieovorans (GPo1) and recombinant strains 
described earlier . in Eggink et ai. (1987b). Like pGEc41 
(pGEc47AJKL), pGEc47AJ is unable to complement the 
alcohol dehydrogenase mutations of P. putida PpS81 and 
PpS597, while pGEc47 allows these mutants to grow on 
n-octane. This shows that alkJ encodes the alcohol dehy- 
drogenase function. 

The growth on octane of E. coli GEc137 carrying 
pGEc47AJ is similar to that of E. coli GEc137 carrying 
pGEc47 t and somewhat faster than that of E. co//GEc1 37 
carrying pGEc41. This indicates that in E. coli AlkJ is not 
essential for growth on alkanes. 

Alcohol dehydrogenase activity of recombinant strains 

Since AlkJ showed homology to two oxidases, we tested 
the oxidase activity of P. putida GPo12 and E. coli 
GEc137, carrying pGEc47 and pGEc47AJ. using a perox- 
idase/2,2'-a2ino-bis(3-ethylbenzthiazoline-6-sulphonic 
acid (ABTS) assay (Verduyn etaL, 1984). However, none 
of these E. coli or P. putida recombinants showed produc- 
tion of H 2 0 2 , even after prolonged incubations. The pres- 
ence of 1 mM Na-azide to inhibit catalase activity did not 
affect the outcome of the test. 

In previous studies Benson and Shapiro (1976) found 
that AIcO (now AlkJ) activity is NAD(P) independent, is 



located in the pellet after centrifugation at 48000 x g, and 
that it could be assayed using the artificial electron accep- 
tor phenazine methosulphate (PMS). It has these proper- 
ties in common with choline dehydrogenase (BetA) of E. 
co//, which shows 37% identity with AlkJ. BetA is strictly 
0 2 -dependent, but it does not produce H 2 0 2 . When Triton 
X-100 is added the 0 2 -dependent activity of BetA is 
destroyed. However, the solubilized enzyme still cataly- 
ses the oxidation of its substrate in the presence of PMS. 
This has led the authors to propose that BetA is electron 
transfer-linked (Lamark et a/., 1 991 ). We therefore carried 
out experiments to determine whether AlkJ is electron 
transfer-dependent as well. 

Total membranes were isolated from several of the 
strains described above. The membrane preparations 
were assayed for PMS/2,6-dichlorophenol-indophenol 
(DCPIP)-dependent activities, and for O z -dependent 
alcohol dehydrogenase activity using a Biological Oxygen 
Monitor. Under these conditions the half-life of AlkJ activ- 
ity Is less than 30 min, and unlike BetA, AlkJ is completely 
inactive with PMS when Triton X-100 is added to a con- 
centration of 1 %. The AlkJ activity was « 0.4 - 1 .0 U mg*" 1 
protein in several experiments, whether PMS/DCPIP 
oxidation or 0 2 consumption was measured. In the 
absence of the a/fc/gene, oxidation levels were lowered 
to « 1 0 -1 5 %, similar to the effect of adding X-1 00. These 
results show that AlkJ does not transfer electrons from 
the substrate to a soluble cofactor, but to oxygen. Since 
no H 2 0 2 is produced, oxygen must be reduced to water, 
most likely through the electron transfer chain. 

Primary structure of AlkK 

Comparison of AlkK with the EMBL databank sequences 
revealed that AlkK shows clear amino acid similarity with 
a number of proteins that activate their substrate by cova- 
lently binding AMP to a carboxy acid group of the sub- 
strate. Turgay et al (1992) have constructed a phyloge- 
netic tree for this family of proteins which shows three 
major branches (a, b, c). Branch a contains two acetyl- 
CoA synthetases; branch b contains, among others, two 
4-coumarate CoA ligases (25 % sequence identity), long- 
chain fatty acid CoA ligase, firefly luciferase, and EntE (an 
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enzyme which activates 2,3-dihydroxybenzoate to the 
corresponding acyladenylate); and branch c contains 22 
domains of proteins that activate amino adds or amino 
acid analogues, and are involved in the synthesis of sec- 
ondary metabolites such as slderophores and antibiotics. 
AlkK Is most closely related to the 4-coumarate CoA lig- 
ases in branch b (25 % sequence identity). The similarity 
of AlkK to the other proteins or protein domains ranges 
from 1 7 to 24% identity. 

When all proteins or protein domains are aligned, very 
few residues are perfectly conserved. Only one small 
region is relatively well conserved in all proteins, including 
AlkK (amino acids 182-193), as was also found by 
Scholten et a/. (1991). The sequence pattern was found 
simultaneously by Bairoch (1992), who noted that it is 
similar to ATP-binding P-loops from different protein fami- 
lies (Saraste et a/. ( 1990), even though an invariant 
glycine is replaced by a proline (Fig. 6). As all proteins 
that are related to AlkK adenylate their substrates using 
ATP, this sequence motif may be involved in binding of 
ATP or AMP. Bairoch (1992), has added it to the 
PROSITE dictionary of sites and patterns found in protein 
sequences as PROSITE PS00455, which is a putative 
AMP-binding domain signature. 

Function of AlkK 

The alkBFGHJ/atkT genes have been shown to specify a 
complete pathway from alkane to fatty acid. Since P. 
putida is able to grow on fatty acids we have assumed 
previously that subsequent oxidation steps, starting with 
the activation of fatty acids to the corresponding acyl- 
CoAs, were encoded by the chromosome. However, the 
fact that AlkK is related to a number of acyl-CoA syn- 
thetases suggested that it might have a similar function. 

We tested whether the alkK gene complemented an 
acyl-CoA synthetase {fadD) mutation in £ coli. In this 
mutant the other fad genes can no longer be induced, 
because the inducer, which is either a long-chain acyl- 
CoA, or long-chain fatty acid, is not formed or transported 
into the cytosol. If AlkK is able to activate or transport 
long-chain fatty acids it should complement the fadD 
mutation completely. However, if AlkK is specific for 
medium-chain-length fatty acids, growth would not occur 
because the fad genes are not induced. In this case a 
fadR mutation, which causes constitutive expression of 
the fad genes, is necessary to obtain growth. Fortunately, 
these mutants appear very frequently when £ coli is 
plated on minimal media with decanoate as a sole carbon 
source (Overath ©fa/., 1969), or when £ coll strains car- 
rying pGEo47 are plated on minimal media with octane as 
the sole carbon and energy source (Eggink et a/., 1 987b). 

£ coli K27 {fadD) was transformed with plasmids 
pGEc47, pGEc41, pGEc47AK and pGEc47AJ. Plasmids 
pGEc47 and pGEc47AJ contain the alkK gene, whereas 



pGEc41 and pGEc47AK do not. Plasmid pGEc47AJ was 
included to test whether deletion of a part of alkJ still 
allows expression of alkK The recombinants were plated 
on minimal medium, and incubated in octane vapour, or 
plated on minimal medium containing 10 mM Na-oleate, 
and incubated in the presence of dicyciopropyiketone 
vapour to induce the alk system. Initially, very weak 
growth was observed with £ coliK27 (pGEc47) and K27 
(pGEc47AJ) on both types of media. No growth was 
observed with K27, K27 (pGEc47AK) and K27 (pGEc41). 
After about 7 d a limited number of large colonies 
appeared on the plates with K27 (pGEc47) and K27 
(pGEc47AJ) incubated in octane vapour. These colonies 
probably represented fadR mutants. The colonies were 
cured of their plasmid and restreaked on oleate-contain- 
ing plates to show that they did not represent revertants of 
the fadD mutation: no growth was observed. The cured 
strain K27 fadR, named GEc354, was then retransformed 
with all four plasmids, and plated on minimal media. 
GEc354 (pGEc47) grew well when incubated in octane 
vapour for 3 d. Growth of GEc354 (pGEc47AJ) was 
somewhat slower. No growth was observed for GEc354 
(pGEc41) and GEc354 (pGEc47AK) (Table 2). 

The results indicate that AlkK is an acyt-CoA syn- 
thetase, with a specificity for medium-chain-length fatty 
acids. This specificity is not unexpected since the other 
aflc-system enzymes are specific for medium-chain-length 
substrates as well. 

Primary structure ofAlkL 

AlkL shows 28% identity in a 1 70-amino-acid overlap with 
the Vibrio cholerae outer membrane protein OmpW (Jala- 
jakumari and Manning, 1990), and Ytt4 (Stoltzfus et a/., 
1988), a hypothetical 22.9 kDa protein in the trpA-tonB 
Intergenic region of £ coli (Fig. 7). The first 60 amino 
acids of AlkL show no homology with OmpW or Ytt4. The 
amino-terminal sequences of the three proteins strongly 
resemble bacterial signal sequences, while the remainder 
of the proteins is hydrophilic. AlkL has three closely 
spaced potential signal peptidase cleavage sites, of 
which the third, between amino acids 27 and 28, scores 
highest by the rules proposed by Von Heyne (1986). The 
calculated molecular mass of AlkL after cleavage of the 
putative signal peptide at this position is 22138 daltons, 
which is reasonably close to the apparent molecular mass 
of 20 kDa found in the minicell experiments. 

All three proteins end with a pattern characteristic of 
outer membrane proteins (Struyv6 era/., 1991). This pat- 
tern has characteristics suggesting that the carboxy-ter- 
minal 10 residues form a membrane-spanning amphi- 
pathic p-sheet. When the three sequences were analysed 
for turn-prone regions according to Paul and Rosenbusch 
(1985), it appeared that many of these regions are 
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Fig. 7. Comparison of AlkL with OmpW and Ytt4. 
The amino acid sequence of AlkL was aligned 
with the sequence of the V. choferaB outer mem- 
brane protein OmpW, and with the hypothetical E. 
co// protein Ytt4. Identical residues are indicated 
by (*). conserved residues are indicated by (.). 
Shaded regions represent turns predicted with the 
algorithm of Paul and Rosenbusch (1985). The 
numbers indicate putative transmembrane fV- 
sheet strands. The YXF motif characteristic of the 
carboxy terminus of the outer membrane© 
proteins is indicated In bold. 
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located at corresponding positions in AlkL, OmpW and 
Ytt4 (Fig. 7, shaded residues). As most outer membrane 
proteins probably have a p-sheet structure (Vogel and 
Jdhnig, 1986), this suggests a structure for AlkL in which 
the polypeptide traverses the membrane about 12 times, 
indicated by the numbers above regions separating the 
putative turns. 

Growth of Pseudomonas putida strains on ethylbenzene 

Fukuda et al (1989) have reported that P. oleovorans 
GPo1 is able to grow on ethylbenzene, and that at least 
the catalytic component of the alkane hydroxylase sys- 
tem; AlkB, is required for growth on ethylbenzene. It is 
possible that the alk system has evolved to allow its host 
to grow on compounds other than alkanes, e.g. ethylben- 
zene. We therefore tested whether AlkJ, AlkK and AlkL 
are required for growth on ethylbenzene. Apparently this 
is not the case; plasmid pGEc41 allows P. putida GPo12 
to grow as well on ethylbenzene as pGEc47. 

Expression of AlkJ, AlkK and AlkL in Pseudomonas 
putida and Escherichia coli 

To test whether AlkJ, AlkK and AlkL are expressed at sig- 
nificant levels, we fractionated P. putida and E coli cells 
carrying plasmids pGEc47 and its deletion derivatives 
(Fig. 8, panel A). Lanes 1 and 2 show total membranes of 
P. putida GPo12 carrying pGEc47. uninduced and 
induced, respectively. Two major and two minor dicycio- 



propylketone-inducible bands are indicated by arrows. 
The 40 kDa band represents AlkB. The second major 
band has a molecular mass of about 59 kDa. This size 
suggested that it represented AlkJ or AlkK. In lanes 3 t 4 
and 5 total membranes of GPo12 carrying pGEc47AJ, 
pGEc47AK and pGEc41, respectively, are shown. In 
lanes 3 and 5 the 59 kDa band is not present, which indi- 
cates that the band represents AlkJ. The AMerrninal 
sequence of the AlkJ band was determined, which 
resulted in the following sequence: Mfil-IyH-ys or Asp- 
Tyr- Val or Ug-]ls or Lys-YaKaly-Ala-Gly. The under- 
lined residues represent the first 10 amino acids of the 
translation of the alkJ ORF, confirming the identification 
of the 60 kDa band. 

The E coli equivalent of AlkK; FadD, is a membrane- 
bound protein, which suggests that AlkK is membrane 
bound as well. However, it is the cytoplasmic fraction of 
GPo12 (pGEc47) that contains a 60 kDa protein (Fig. 8, 
panel B, lane 2), absent when the alk genes are not 
induced (lane 1), and absent when part of the alkKqene 
is deleted (lane 3). 

The primary sequence of AlkL suggests that it is local- 
ized in the outer membrane. We studied the expression of 
AlkL in E. coli strains, because E coli membranes can 
easily be separated by sucrose-density centrifugation, 
which is not the case for P. putida membranes. E coli 
W31 1 0 was used because it expresses the Alk proteins to 
high levels (M. Nieboer and B. WHholt, unpublished). 
Panel C of Fig. 8 shows that in outer membranes of 
W31 10 (pGEc47) cells, induced with dicyclopropylketone, 
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Fig. 8. Expression of AlkJ, AlkK and AlkL in P. putida and £ coll. P. putfda GPo12 and E. coll W31 10 strains were grown in E-2 medium and induced with 
DCPK for expression of the aft genes where specified, as described In the Experimental procedures. Cell fractions were isolated and analysed for the 
presence of AlkJ, AlkK and AlkL Arrows mark olcyclopropylketone-inducible proteins. 

A. Total membranes of P. putfda QPo12 strains, Incubated for 30 min at 37°C with 1 volume of denaturatlon buffer, and separated on a 12.5% potyacry- 
lamide gel. Lane 1. GPo12 (pGEc47; lane 2, GPo12 (pGEc47) Induced; lane 3, GPo12 (pGEc47oJ) Induced; lane 4, GPo12 (pGEc47AK) induced: lane 5, 
GPo12 (pGEc41) Induced. 

B. Cytoplasmic fractions of P. putida GPol2 strains, incubated for 5 min at 100°C, and separated on a 7.5% polyacrylamide gel. Lane 1 , GPo1 2 
(pGEc47); lane 2, GPo12 (pGEc47) Induced; lane 3, GPo12 (pGEc47AK) induced. 

C. Outer membranes of E. coti W31 10 strains, separated on a 12.5% gel. Samples shown in lanes 1, 2 and 3 were boiled, samples shown in lanes 4 and 
5 were Incubated at 37°C. Una 1, W31 10 (pGEc47); lane 2, W31 10 (pGEc47) Induced; lane 3, W3110 (pGEc47AL) induced; lane 4, W31 10 (pGEc47); 
lane 5. W31 10 (pGEc47) induced. 



a 25 kDa band is present (lane 2), which is absent in outer 
membranes of W3110 (pGEc47AL) (lane 1), and unin- 
duced W3110 (pGEc47) (lane 3). This strongly suggests 
that the band represents AlkL. 

A 21 kDa dicyclopropylketone-inducible band is also 
visible in lanes 2 (pGEc47), 3 (pGEc47AJ) and 4 
(pGEc47AK) of Fig. 8., Panel A. In all three cases the alkL 
gene is present. In lane 5 (pGEc41 :AJKL) the band is 
absent, as would be expected if it represents AlkL. The 
size difference can, In part, be explained by the sample 
preparation. Total membranes of GPo12 are normally 
mixed 1:1 with denaturing buffer, and incubated at 37°C 
for 30 min, while E. coli outer-membrane preparations are 
usually boiled. When W31 10 (pGEc47) outer membrane 
samples were treated at 37°C, the AlkL band shifted to 20 
kDa (Panel C, lanes 4 and 5, uninduced and induced, 
respectively). Many other outer membrane proteins show 
the same heat-modifiable behaviour, as has been noted 
previously.(Wensink and Witholt, 1981). 



Discussion 

In this paper we present the nucleotide sequence of the 
distal three genes of the atkBFGHJKL operon. We show 
that the alkJ gene encodes the NAD(P)-independent 
alcohol dehydrogenase studied by Benson and Shapiro 
(1976), alkK encodes an acyl-CoA synthetase and alkL 
encodes an outer membrane protein of unknown function. 
The results confirm the alcohol dehydrogenase marker- 
rescue experiments of Owen et al. (1984), and the pep- 
tide mapping by Eggink et al. (1987a). The work pre- 
sented here, and that published before is summarized in 
Fig. 9. 



Product of the alkJ cistron 

The translation product of the alkJ cistron has previously 
been identified in E. coli minicell expression experiments 
(Eggink et al., 1987a). The size and location of the first 
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Fig. 9. Genetic structure of the P. oteovorans alk regulon and functions of the encoded proteins. The location of the aik open reading frames (open bars), 
the apparent molecular masses ( xtO -3 ) of the Alk proteins determined In minicell expression experiments (Eggink et at., 1987a). and their functions are 
shown. 
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ORF in the DNA sequence presented in Fig. 2 corre- 
sponds perfectly to that determined in these earlier exper- 
iments (Table I). The AMerminal sequence, as deter- 
mined by gas-phase sequencing, was identical to that 
predicted from the DNA sequence. 

The AlkJ amino add sequence shares clear homology 
with a number of enzymes: choline dehydrogenase, glu- 
cose dehydrogenase,_glucose oxidase and alcohol oxi- 
dase (Fig. 3), the last three of which have been shown to 
be flavin enzymes. AlkJ is NAD(P)-independent, and pos- 
sesses a characteristic ADP-binding fingerprint close to 
the amino terminus, like the homologous enzymes 
(Fig. 5). This suggests that AlkJ binds FAD as well. 

We could not detect any H 2 0 2 production by strains 
carrying the afkJ gene. The possibility that AlkJ, like the 
choline dehydrogenase (BetA) is an electron transfer- 
dependent dehydrogenase was tested. The results show 
that oxygen is consumed when 1-octanol is added to 
membrane preparations containing AlkJ. The same 
preparations contain the PMS/DCPIP oxidizing activity 
studied by Benson and Shapiro (1976). Both activities are 
lost when a part of the aikJ gene is deleted. 

The alignment of AlkJ in "Fig. 3. shows that the dehydro- 
genases are related to the oxidases, in this respect it is 
interesting that the two oxidases have no access to the 
electron-transport chain. Yeast alcohol oxidase is located 
in the yeast peroxisome, while glucose oxidase is 
exported to the medium. Drosophila glucose dehydroge- 
nase is exported into the ejaculatory ducts and has no 
access to the electron-transfer chain either. The in vivo 
electron acceptor of this enzyme is unknown (Cavener 
and Maclntyre, 1983). 

The properties of AikJ and BetA are similar to those of 
the dye-linked alcohol dehydrogenase of P. aenjginosa 
described by Tassin et al (1973). The P. aenjginosa 
enzyme was later shown to be a periplasmic pyrrolo- 
quinoline quinone-containfng enzyme (Groen et at., 
1984), linked to the electron-transfer chain through a spe- 
cial-purpose newly characterized cytochrome (J. A. 
Duine, personal communication). AlkJ and BetA may use 
coenzyme Q as their entry point for the electron-transfer 
chain, similar to the succinate, fumarate, lactate and 
NADH dehydrogenases, which are pan! of central 
metabolic pathways, contain FAD and can be assayed 
with PMS/DCPIP. 

Product of the alkK cistron 

No functions had previously been assigned to, or pro- 
posed for, the distal two cistrons of the operon because 
no mutations affecting alkane catabolism had been 
mapped to these genes. In addition, the alkane hydroxy- 
lase system, encoded by alkB, aMcGand alkT, the alcohol 

dehydrogenase AIKJ and the aldehyde dehydrogenase 



AlkH together already constitute a complete pathway 
from alkanes to fatty acids. 

The homology of AlkK to several acyl-CoA synthetases 
suggested that AlkK might catalyse the first step in the 
fatty add p-oxidation pathway to form an alkanoyl-CoA 
intermediate. This notion was confirmed by complemen- 
tation of an E coti acyl-CoA synthetase (fadD) mutation w 
by plasmids containing the alkK gene. E coii K27 fadR 
carrying the alk system grows equally well on octane 
when compared with E co//GEc137 carrying the alk sys- 
tem, which shows that alkK can fully complement the 
function of fadD. As AlkK only complements the fadD 
mutation for medium chain-length substrates we con- 
clude that AlkK is specific for these substrates. The 
observation that AlkK is a soluble protein, unlike its E coii 
equivalent FadD, also suggests that its substrates must 
be relatively soluble. 



Product of the alkL cistron 

The last ORF of the alkBFGHJKL operon (alkL) encodes 
a 25.0 kDa polypeptide, with a signal peptide-like 
sequence. This may be cleaved off to form the mature 
alkL gene product, homologous to a V. cholerae outer- 
membrane protein, OmpW. Several features of the pri- 
mary structure suggest that AlkL is an outer membrane 
protein which could cross the membrane about 12 times 
in an amphipathic (3-sheet structure. Outer membranes of 
W31 10 carrying pGEc47 contain an alkane-inducible pro- 
tein with a molecular mass of 25 kDa. When the alkL gene 
is not induced, or deleted, this protein band is not present. 



Function of AlkJ, AlkK in alkane catabolism 

AlkJ and AlkK were identified as alcohol dehydrogenase 
and acyl-CoA synthetase, respectively. AlkJ is essential 
for growth on alkanes only in P. putida alcohol dehydro- 
genase mutants, while AlkK is only essential for growth 
on alkanes when the corresponding chromosomal func- 
tion is missing, e.g. in the E coii fadD mutant K27. We do 
not know whether only the first step of fatty acid degrada- 
tion is encoded by the a/fcoperons. It is quite possible that 
other non-essential p-oxidation genes are located outside 
the regions that were cloned from the OCT-plasmid. 

As AlkL is located in the outer membrane, it could be 
involved In transport of substrates. Many compounds that 
are oxidized by the alkane hydroxylase system in vitro 
(McKenna and Coon, 1970), are not converted by the 
enzyme system in vivo (Bosetti et ai. f 1992; J. B. Van 
Beilen, unpublished results). Compounds that are con- 
verted in vivo may utilize a transport system to enter the 
cell. Experiments to test whether AlkL has a transport 
function have thus far yielded contradictory results (not 
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shown). However, in these experiments the alkane sub- 
strates, were present at 2% of the total culture volume. 
AikL may be necessary only at (very) low substrate con- 
centrations. An Indication that this is the case is the low 
Km (less than 5 of the alkane hydroxylase for r> 
octane. Alternatively, it is possible that the function of 
AlkL is specific for the original host (probably a Gram- 
negative bacterium because AlkL is an outer membrane 
protein), or for the original substrate of the aik proteins, 
which may not be a medium-chain-length alkane. The 
observation that the alkJKL genes are not necessary for 
growth on ethylbenzene suggests that this 'native* sub- 
strate may again be a different compound. 

Evolutionary aspects of alkane oxidation 

We have shown that the P. oleovorans a/Ac system 
encodes several functions (AlkH, AlkJ and AlkK) that 
have a chromosomal equivalents (Kok et a/., 1989b), and 
two proteins the functions of which are unknown (AlkF 
and AlkL). The codon usage and G+C content of the alk 
system differ significantly from the rest of the OCT-plas- 
mid and the P. oleovorans chromosome, suggesting a 
transfer of the alk system to P. oleovorans and the OCT- 
plasmid. The fact that the ORFs of the non-essential Alk 
polypeptides mentioned above are still intact indicates 
that this must have been a relatively recent event. 

Experimental procedures 

Bacterial strains, plasmids and growth conditions 

The bacterial strains and plasmids used are listed in Table 2, 
Cells were grown either on tryptone-yeast extract (TYE) 
medium or on E-2 medium (Eggink et a/., 1987b), supple- 
mented with carbon sources, thiamine or antibiotics. For 
growth on n-octane or 1-octanol, the Petri dishes were incu- 
bated at 32°C in a sealed container under noctane or 1- 
octanol vapour. Ampicillin was used at 50 mg r\ tetracycline 
was used at 12.5 mgr 1 . 

Mobilization of pLAFRI derivatives from E. coti to Pseu- 
domonas was done according to Figurski and Helinski (1979), 
by the replica plating technique using the helper plasmid 
PRK2013 (Ditta et a/., 1980). E. coli strains were cured of 
pGEc47 or its derivatives by growing cells in the absence of 
tetracycline while inducing the alk system with dicyclopropylke- 
tone. 



Table 2. List of strains and plasmids, and relevant genotype and/or 
phenotype. 
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Heisvani genotype 


Growth on 






or piasmio 


octane 


Cai i#aa a §f DaIaiaaaa 

oource or neierence 


c .Mil 








OCOftft 

oroOU 


Vlf] I < A _ _ / A 
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Laboratory collection 


W3110 


K , tnyA36 t aeoC2, IN 1 




P 1 — / 4 AA'^ri 

Bacnmann (1987) 


GEC137 


Dm, fear? 




Eggink at a!. (1987b) 


GEc93 


GEc137, pGEc47 




Eggink e/a/. (1987b) 


GEc139 


GEc137, pGEc41 


++ 


Eggink era/. (1987b) 


GEc350 


GEC137, pGEc47Aj 


♦4 


This study 


GEC358 


GEc137, pGEo47AK 




This study 


GEC359 


GEc137 t pGEc47AL 




This study 


K27 


K12Ymel, faoX> 




Overath at at. (1869) 


GEc351 


K27, pGEc47 




This study 


GEC354 


K27, rao/? 




This study 


Obc355 


GLC354, pG Ec47 


♦+♦4 


This study 


GcC356 


Gtc354, pGEc41 




This study 


ucCOOr 


ucC354, puc0476J 


++ 


This study 


U t Co oU 


UtCob4, ptibC47AK 




This study 


ucCooi 


W3110, pGtc47 




This study 


Gcg3o2 


W31 1 0, pGEc47AL 




This study 


r. ffutiaa 








GP01 


Prototroph (OCT-1 ) 


++++ 


Schwartz and 








McCoy (1973) 


Gro12 


GPo1 cured of OCT 




Kok (1988) 


GPp202 


GPo12, pGEc47 




Eggink ef a/. (1987b) 


GPpZ03 


GPo12 w pGEc41 


++ 


Eggink e/af. (1987b) 


GPpZ04 


GPo12, pGEc47/VJ 


+♦ 


This study 


GPp205 


GPo12, pGEc47AK 




This study 


GPp206 


GPo12,pGEc47AL 


+++ 


This study 
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PpS597pGEc47 


+4+ 


This study 
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PpS597pGEc41 
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TkU M 4, uJ., 

inis study 


GPp210 
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w- 


1 nts stuoy 


Plasmids 








PRK2013 
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Ditta ef a/. (1980) 


pGEc51 


pBR322, atkHJK'Hin&m insert 


Eggink etal. (1987a) 


DGEC47 


pLAFRI, afkST/alHBFGHJKL 


t j _ . . a An*TI- V 
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pGEc41 


pLAFR 1 , alkS T/alkBFGH 


Eggink etal (1987b) 


pGEc51AJ 


pGEc51, deletion in afkJ 




This study 


pUCl8JLAK pUCl8, see text and pGEc47AK 


This study 


pUCtSJKAL pUClB, see text and pGEc47AL 


This study 


pGEc47AJ 


pGEc47, 774 bp PsR deletion 


This study 




\nalkj 






pGEc47AK pGEc47 t 568 bp H/ndlll deletion 


This study 




In alkK 






pGEc47AL 


pGEc47, deletion of alkL from 


This study 




Sa/I(8000) to BanH\ (-9900) 





DNA manipulations 

Plasmid DNA was extracted according to Birnboim and Doly 
(1979). DNA fragments were isolated as described by Zhu et 
al. (1985). Restriction enzymes and T4 DNA tigase were pur- 
chased from Boehringer Mannheim and used according to the 
specifications of the supplier. E. constrains were made compe- 
tent for DNA transformations according to Chung and Milter 



(1986). Nucleotide sequencing was done according to Sanger 
etal. (1977) using M13mp18 and M13mp19 as vectors to pre- 
pare single-stranded DNA. The sequence strategy (Fig. 1 E) 
was a combination of primer walking, shotgun cloning, and 
forced cloning. 

The nucleotide and amino acid sequences were analysed 
using programs collected in the PC/gene software package, 
developed by A. Bairoch (Genofit/lntelligenetics), the NAQ and 
psq database search programs from the Protein Identification 
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Resource (Dayhoff et a/., 1983) on the VAX1 1/750 computer, 
the Swiss-Prot Protein database (EMBL, Heidelberg, release 
18), and the fastp sequence alignment program (Lipman and 
Pearson, 1985). Multiple alignments and evolutionary trees 
were generated by the method of Higgins and Sharp (1988). 

N-terminal sequence determination 

Proteins separated by SDS-potyacrytamlde gelelectrophoresis 
were transferred to polyvlnylidene difluoride (PVDF) mem- 
branes (Immobilon Transfer, Millipore) according to Matsudaira 
(1987). The protein bands were stained with Coomassie bril- 
liant blue and cut from the membrane. The AMerminal amino 
acid sequence was determined by Eurosequence B.V., Gronin- 
gen, The Netherlands with a Model 477A, Applied Biosystems 
gas-phase protein sequenator. 

Construction of deletion derivatives ofpGEc47 

Plasmids pGEc47AJ, pGEc47AK, pGEc47AL are derivatives of 
pGEc47 in which a 747 bp Psfl fragment was deleted from the 
a/fcjgene, a 586 bp H/ndlll fragment was deleted from the afkK 
gene, and a -1900 bp Sal\-BamH\ fragment containing the 
alkL gene was deleted, respectively. All three deletions were 
first created in pMB 1 plasmids- carrying only (part of) the afkJKL 
region. The deletions were then transferred to pGEc47 by 
marker rescue as described below. 

To construct pGEc47AJ a Psfl fragment (4583-5357) was 
deleted from pGEc51 (Eggink et al., 1987a), which contains a 
H/ndlll fragment (2753-7337) with alkH, alkJ, and part of alkK. 
For alkK the Psfl(5357)-H/ndlll(7337) and H/ndlll(7923)- 
£coRI(«10500) fragments were combined with pUC18 cut with 
Psfl and EcoRI. This resulted in a plasmid which contains the 
entire alkJKL region, except for part ol alkK. To construct an alkL 
deletion, first the Psfl(5357)-SaA(8rjOO) fragment was cloned in 
pUC18, and subsequently the BarnM\(~S900)-Eccft\{'» 10500) 
fragment was added, resulting in a construct which lacks alkL 
but contains the flanking DNA (a simple Sari deletion was not 
possible because of the presence of a previously unreported 
Sah site close to the EcoRI site). 

E. coli SF800 (polA-) containing pGEc47 was transformed 
with the three deletion plasmids, pGEc51 AJ, pUC18JLAK, and 
pUCl 8JKAL. Because these plasmids are pMB1 plasmids they 
cannot replicate in SF800. Ampicillin-resistant. (Ap R ) colonies 
can only be obtained by homologous recombination of these 
plasmids into pGEc47. A subsequent selection for loss of the 
Ap marker, again by homologous recombination, effectively 
resulted in transfer of the deletions to pGEc47 in about 50% of 
the ampicillin-sensitive (Ap s ) colonies. 



Alcohol dehydrogenase assay 

E. coli and P. putida recombinants were grown overnight in 50 
ml LB medium. These cultures were used to inoculate 500 ml 
LB medium. After 1 h of incubation at 30°C, the alk genes were 
Induced by the addition of dicyclopropylketone to 0.02%. The 
cultures were harvested 3 h after induction. Total membrane 
preparations for alcohol dehydrogenase assays were prepared 
as described below. Protein concentrations were determined 



according to Marfcweti et al. (1981). The alcohol dehydroge- 
nase assay based on the reduction of DCPIP in the presence 
of PMS as described previously (Tassin et a/., 1973). "fte 
assay system contained 2.7 ml of 50 mM Tris-HCI (pH 7.5), 1 
mM KCN. 2.2 mM 1-octanol and 0.2 mM DCPIP. This solution 
was allowed to equilibrate for 5 min at 30°C before 0.1 ml of a 2 
mg mr 1 PMS solution was added. After a blank rate was 
recorded appropriate quantities of membranes (10-50 ul, 
0.1-1 .0 mg of protein) were added to initiate the reaction. The 
activity was assayed by measuring the decrease in 
absorbance of DCPIP at 600 nm. Activity is expressed as 
mlcromoles of DCPIP reduced per minute per milligram of pro- 
tein. DCPIP has an extinction coefficient of 20.6 mM" 1 cm" 1 at 
600 nm. 

Oxygen consumption by membrane preparations was mea- 
sured using a Biological Oxygen Monitor. To a 5 ml volume of 
50 mM Tris-HCI buffer pH 7.4, saturated with oxygen, 0.5 ml of 
membrane suspension was added. This mixture was allowed 
to equilibrate for 2 min. 1 -Octanol was added to 2 mM as a 1 M 
solution in ethanol. KCN was added as a 1 M solution in 50 mM 
Tris-HCI buffer, pH 8.0. PMS was added as a 2 mg ml -1 solu- 
tion in 50 mM Tris-HCI buffer pH 7.4. To test NADH oxidase 
activity NADH was added to 100 pg mr 1 . Triton.X-100 was 
used at a concentration of 1 %. 



Alcohol oxidase assay 

Alcohol oxidase activity was assayed according to Verduyn et 
al. (1984). The chromogenic substrate ABTS is converted into 
a green product by peroxidase and hydrogen peroxide. The 
assays were carried out in 50 mM K-phosphate pH 7.5, 1 mM 
Naazide. Azide was added to prevent interference from cata- 
lase activity. The AlkJ and peroxidase activities were not 
affected by the addition of azide, as tested for AlkJ in the Bio- 
logical Oxygen Monitor, and for peroxidase by measuring the 
disappearance of H 2 0 2 at 240 nm. Calculations were based on 
an extinction coefficient of the green product of 43.2 mM'^m -1 
at412nm. 



Isolation of membrane fractions 

P. putida total membranes were isolated essentially as 
described by Van Heerikhuizen et al. (1975). 2 ml L-broth 
precultures were inoculated from fresh plates and grown during 
daytime. Erlenmeyer flasks containing 50 ml of E2-medium 
with 0.5 % glucose were inoculated with 2 ml of the preculture. 
After 2 h the cultures were induced with 10 ul dicyclopropylke- 
tone. Cells were harvested after overnight incubation at 30°C, 
and converted to sphaeroplasts. These were disrupted by pas- 
sage through a French Press cell at 69000 kPa. After removal 
of large fragments and whole cells, total membranes were pel- 
leted. E. coli membranes were separated on a sucrose-density 
gradient. Samples were analysed on 7.5 or 12.5 % SDS-poly- 
acrylamide gels (Laemmli, 1970). 
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Plasmodium is unable to carry out de novo fatty acid synthesis and has to 
obtain these compounds from their host for subsequent activation by 
thioesteriftcation with coenzyme A. This activity is catalyzed by a fatty 
acyl-CoA synthetase enzyme (EC 6.2.1.3). Here, we describe a novel gene 
from P. falciparum whose recombinant purified product from baculovirus- 
transfected insect cell line had the enzymatic activity of a long-chain fatty 
acyl-CoA synthetase. It was named p/acsl, since it belongs to a multi- 
member gene family as revealed by the sequence of several clones and a 
multi-band pattern in Southern blots. The sequence specifies a product of 
820 amino acid residues. It was transcribed and expressed in infected 
erythrocytes having an apparent molecular mass of 100 kDa. Immuno- 
labeling of infected erythrocytes with a specific antibody against the 
carboxy-terminal part of the PfACSl localized the product early after the 
erythrocyte invasion in vesicle-like structures budding off the parasitofor- 
ous membrane toward the red cell cytoplasm. Its unique carboxy- 
terminal structure of 70 extra amino acid residues, longer than any other 
reported acyl-CoA synthetase, is probably related to its localization in the 
cytoplasm of the host erythrocyte. The phylogenetic relationship among 
other AMP-forming enzymes, placed PfACSl closer to Saccharomyces 
cerevisiae, sharing significant amino acid identities, especially in the 
conserved signature motif that modulates fatty acid substrate specificity 
and ATP/AMP-binding domains. Taking into account the importance of 
this enzymatic activity for the parasite, its extra-cellular location inside 
the infected erythrocyte, and the divergence with respect to the homolo- 
gous human enzymes, it may be an important protein as a potential 
target candidate for chemotherapeutic antimalaria drugs. 

© 1999 Academic Press 

Keywords: Plasmodium falciparum-'mfected erythrocyte; acyl-CoA 
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Introduction 

Malaria remains one of the most important 
diseases of humans in terms of both mortality and 
morbidity, with Plasmodium falciparum being the 
most important infecting agent, killing over one 
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million children each year (Sturchler, 1989). Con- 
tinual exposure of malarial parasite populations to 
different drugs may have selected for resistance to 
individual drugs and for genetic traits that favor 
initiation of resistance to novel unrelated antima- 
larials (Rathod et al, 1997). Given this situation, 
there is a desperate need for new and better thera- 
peutic drugs and targets. 

During part of its life-cycle, the malaria parasite 
lives inside vertebrate erythrocytes. Although 
during its intraerythrocytic development Plasmo- 
dium spp causes a considerable increase in total 
fatty acid content of the host cell (Simoes et al, 
1992; Beaumelle & Vial, 1988), at this stage neither 
the parasite nor the host erythrocyte possesses bio- 
chemical pathways for the synthesis of fatty acids 
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(Holz, 1977; Haldar et al, 1985). To fulfil its nutri- 
tional requirements and construct a complex series 
of membranous tubules and vesicles throughout 
the erythrocyte, the parasite is able to actively take 
up fatty acids, either in free form from the blood 
plasma or as lysophospholipids from the host 
erythrocyte membrane, and to incorporate these 
into complex molecules, such as diacylphospholi- 
pids (Lauer et al, 1997; Glick & Rothman, 1987). 
The first step for the incorporation of fatty acids is 
the thioesterification with coenzyme A (CoA), an 
enzyme activity that is known to be increased 
20-fold in Plasmodium knowlesi-mfected cells 
(Beaumelle & Vial, 1988). In mammals, fatty acid 
utilization is initiated after their activation 
catalyzed by acyl-CoA synthetases (ACS, fatty 
acid:CoA ligase, AMP-forming; EC 6.2.1.3). The 
acyl-CoA, produced from fatty acid, ATP, and 
CoA by ACS, is a key intermediate in various 
metabolic pathways including protein transport, 
enzyme activation, protein acylation, cell signaling, 
and transcriptional control, in addition to serving 
as substrates for P-oxidation and phospholipid bio- 
synthesis (McLaughlin & Aderem, 1995; Korchak 
et al, 1994). 

In this work, we have isolated and characterized 
the first acyl-CoA synthetase from P. falciparum 
(PfACSl). It belongs to a multigene-member family 
with different isotypes. PfACSl may be a candi- 
date target enzyme for a chemotherapeutic 
approach, since it is essential for the parasite devel- 
opment, and there are large differences in the 
amino acid sequences between P. falciaparum 
enzymes and those reported for humans. 

Results 

Isolation and characteristics of the P/acs1 gene 

Our experiments were designed to complete the 
coding sequence of a P. falciparum antigen-encod- 
ing clone (clone 15) that showed significant hom- 
ology to acyl-CoA synthetases from other 
organisms, and it was named PfACSl in view of 
the existence of several homologous genes in 
the parasite genome (Figures 1 and 2). Toward 
this end, we isolated two overlapping A,gtl0 
P. falciparum clones by screening a library first with 
the clone 15 probe (probe-c) obtaining the clone 34, 
and secondly, with a probe from the 5' end of 
clone 34 (probe-a) resulting in the close 27. Two 
primers (IP19 and IP20) at the 3' and 5'-ends of 
the presumed coding sequence were used to PCR 
amplify the complete insert from P. falciparum 
genomic DNA in three independent PCR reactions. 
Eight IP19-IP20 fragments were subcloned and 
sequenced. Some punctual nucleotide changes 
found in only one of these clones were considered 
PCR artifacts and the consensus sequence is rep- 
resented in Figure 1. The longest open reading 
frame (ORF) of the P/acsl sequence was 2460 bp, 
potentially encoding a protein of 820 amino acid 
residues with a calculated molecular mass of 
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■ ESS PfACSl 

ss ACS-like Led 

Figure 1. Schemes of general characteristics of the 
deduced amino acid sequence of the P/acsl gene. The 
hydropathy plot of the product according to Hopp & 
Woods (1981) is shown in which positive index 
represents hydrophilicity and negative index hydropho- 
bicity. The scheme of the PfACSl (P. falciparum acyl 
CoA synthetase-1), and the presumed relevant parts of 
the amino acid sequence are shown: ss (black box), 
stands for signal sequence; ACS-like (white box), stands 
for acyl-CoA synthetase-like sequence; and Led (striped 
box), signifies lysine-enriched domain. The GenBank 
accession number of p/acsl is AF007828. 



94,774 Da. The deduced amino acid sequence has 
an unusually high content of basic amino acids (15 
Arg, 89 Lys, and 12 His) especially in the 70 
carboxy-terminal residues, where the lysine content 
is 22 % (Lys-enriched domain, Led). Consequently, 
it has a deduced isoelectric point (pi) of 9.67. The 
N terminus (residues 1-20) has the characteristic 
features of a signal sequence (ss) with a hydro- 
phobic core between residues 3 and 15, which is an 
essential part required for targeting and membrane 
insertion (Martoglio & Dobberrstein, 1998). The 
Led contains potential motifs for protein kinase C 
phosphorylation, N-myristoylation and N-glycosy- 
lation. 

P/acs1 homologous genes, transcription and 
expression of the product in P. falciparum 

In order to determine the copy number of the 
P/acsl gene, Southern blot experiments were per- 
formed using the radiolabeled probes from the 
5'-region (probe-a) and 3'-region (probe-b) of the 
P/acsl coding sequence that do not contain clea- 
vage sites for any of the restriction enzymes used 
(Figure 2). Digestion of the DNA with EcoRI plus 
C/al was clearly indicative of a multiple band 
pattern. Hybridization with probe-b revealed the 
existence of at least four homologous gene 
sequences. The probe-a reacted with only two of 
these four homologous sequences (Figure 2(a)). In 
spite of these apparent 5'-end polymorphisms, all 
of these homologous sequences contained two Ndel 
sites at approximately the same distance, giving 
rise to a 2.1 kb fragment (possibly containing two 
close bands in Figure 2(b), lanes 3 and 5). There- 
fore, these data indicate the existence of a multi- 
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Figure 2. Southern, Northern and 
Western blot analysis of pfacsl. 
(a) and (b) show Southerns of 
P. falciparum genomic DNA probed 
with radiolabeled probe-a (IP15- 
IP18, 151 bp) and probe-b (1389- 
IP8, 434 bp) (see bottom scheme) as 
explained in Materials and 
Methods. DNA was digested with 
EcoRI (lane 1), CM (lane 2), Ndel 
(lane 3), EcoRI + Clai (lane 4), 
EcoRI + Ndel (lane 5), HmdIII (lane 
6), BamHl (lane 7), and BamHI- 
+ Hmdm (lane 8). (c) Northern blot 
of total RNA from P. falciparum- 
infected erythrocytes probed with 
DIG-labeled prober (1165-IP8, 
715 bp) (see bottom scheme), 
(d) Western blots of extracts 
from saponin- treated P. falciparum- 
infected erythrocytes (iE) and sapo- 

nin-treated uninfected erythrocytes (E), separated in SDS-10% PAGE, and incubated with the specific anti-15 
antibody at 1/1000 dilution. Control Western blots with pre-immune serum, anti-TrpE antibody, and secondary anti- 
body alone were negative. The exact position of the probes are indicated in Materials and Methods. 
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gene family comprising at least four members 
homologous but not identical with the P/acsl gene. 

Analysis of total RNA from P. falciparum-miected 
erythrocytes by hybridization of blots with the 
3'-region probe-c revealed a single transcript of 
about 3.5 kb (Figure 2(c)). The transcription of 
the P/acsl gene was confirmed by RT-PCR 
and sequencing (not shown) of total parasite 
RNA. Therefore, the P/acsl gene is transcribed in 
P. /a/c/pflrum-infected erythrocytes. 

To confirm the expression at the protein level 
and to determine the size of the PfACSl gene 
product, Western blots of saponin-treated 
P. /fl/riparum-infected erythrocyte extracts were 
performed (Figure 2(d)) using the anti-15 antibody 
(specific for the carboxy-terminal half of the 
PfACSl protein). This antibody reacted with a 
major protein band with an apparent molecular 
mass of 100 kDa as determined by SDS-PAGE and 
a minor band with a molecular mass of around 
75 kDa. 



Comparison of PfACSl with the GenBank 
data sequence 

Searches of protein databases revealed signifi- 
cant similarities of PfACSl with long-chain fatty 
acid CoA synthetases from bacterial to mammals 
(EC 6.2.1.3.). The homology search revealed that 
the PfACSl sequence is more closely related to 
the acyl-CoA synthetase of yeast Saccharomyces 
cerevisiae than with other higher or lower eukar- 
yotes and prokaryotes. PfACSl seems to be closer 
to the yeast Faalp, Faa3p and Faa4p acyl-CoA 
synthetases than Faa2p (Johnson et al., 1994; Knoll 
et aL, 1994) (Figure 3). Sequence alignments indi- 
cated that the PfACSl protein has 20-23 % identity 
and 32-34% similarity with human, yeast rat, 



mouse, Brassica napus or Haemophilus influenzae 
fatty acid acyl-CoA synthetases (not shown). 
Further analysis of these data indicated that several 
regions of these enzymes were highly conserved 
(Figure 4). Several dispersed regions within the 
PfACSl sequence seem very similar to others 
found in adenylate-fonrting enzymes containing an 
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Figure 3. Phylogenetic analysis of PfACSl and other 
members of the adenylate (AMP)-forming enzyme 
family. The tree was constructed by the GROWTREE 
and the DISTANCE programs of the GCG package and 
corrected using the Juker-Cantor method. Different 
acyl-CoA synthetases (ACS), peptide synthetases (PS) 
(Surfactine), firefly (Photinus pyralis) luciferase (Luc), and 
coumarate ligases (Coum) are shown. 
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Figure 4. Alignment of the PfACSl and other fatty acid CoA synthetases. Brassica napus (Z72153), Human 2 
(P33121), Saccharomyces cerevisiae Faalp (S. c. Faalp) (P30624), S. c. Faa2p (P39518), and Haemophilus influenzae 
(P44446), P. falciparum acyl-CoA synthetase 1 (PfACSl) were aligned with PILEUP of a GCG/EMBNET facility. Iden- 
tical amino acids in three or more out of six sequences are on a black background, similar residues in three or more 
are on a gray background. Underlined with asterisks (*) is a proposed acyl-CoA synthetase signature motif (Black 
et a\. t 1987). The double underline at the C terminus of PfACSl is the Led domain, the P. falciparum sequence 
proposed to be involved in the extra-parasitic localization of the protein. 



AMP-binding site and the ATP-binding pocket 
within the fatty acyl-CoA synthetase (Conti et al, 
1996; Black et al, 1987; Johnson et al, 1994). A 
highly conserved 25 amino acid residue segment 
within this region is common to all fatty acyl-CoA 
synthetases (underlined with asterisks in Figure 4) 
(Black et al, 1987). These alignments revealed that 
the PfACSl has a striking longer carboxy-terminal 
region of 70 residues corresponding to the Lys- 
enriched domain (Led) of the parasite protein 
(double-underlined in Figure 4). Interestingly, an 
N-terminal hydrophobic 20 amino acid residue 
sequence, characteristic of membrane proteins, was 
not found in any of these acyl-CoA synthetases 
analyzed. As determined by SDS-PAGE, the size of 
PfACSl is greater (100 kDa) than that of mamma- 
lian acyl-CoA synthetases (ACS) (70-80 kDa). 



Expression and purification of PfACSl 

In order to determine the enzymatic activity of 
PfACSl, the whole coding sequence, including a 
six His tag at its C terminus, was produced and 
purified from baculo virus/ Sf9 insect cells. 
Attempts to express it in different Escherichia coli 
systems, including those lacking the Fad gene 
(Knoll et al, 1994), were unsuccessful. Toxicity of 
the protein and mutations introduced by the bac- 
teria accounted for this failure. Thus, Sf9 cells were 
transfected with recombinant PfACSl x 6His or a 
mock virus (non-recombinant) and its expression 
analyzed by SDS-PAGE and Western blots 
(Figure 5). Using the Triton X-100 soluble proteins 
of the transfected Sf9 cells as starting material, and 
chromatography on Ni 2+ -NTA resin, 
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Figure 5. Expression and purification of full-length 
PfACSl x 6His in Baculo virus /Sf9 insect cells. 
Pf ACS1 x 6His expressed in recombinant Sf9 cells were 
subjected to chromatography using Ni 2+ -NTA agarose 
under native conditions indicated in Materials and 
Methods. Visualization of protein extracts was 
performed by Coomassie blue stain of an SDS-7.5% 
PAGE and Western blots with the anti-15 antibody. M, 
Molecular mass markers; T, total Sf9 cell extract in the 
presence of 40 mM imidazole; U, flow-trough unbound 
extract; W, last-washed extract; E, fraction eluted with 
240 mM imidazole, pH 7.5. 



PfACSl x 6His was highly purified (>90 %) as 
shown by SDS-PAGE stained with Coomassie 
brilliant blue. This major protein of 95 kDa was 
specifically reactive with the anti-15 antibody in 
Western blots. Some minor proteins that co-puri- 
fied with the 100 kDa protein was also reactive 
with the anti-15 antibody. These minor proteins 
likely represent degradation products of 
PfACSl x 6His. 
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Figure 6. Acyl-CoA synthetase activity of purified 
recombinant PfACSl x 6His protein. Effects pf co- 
substrate on the [ 3 H]palmitoyl-CoA synthetase activity 
of PfACSl x 6His represented as percentage of activity 
with respect to the activity obtained in an optimal 
reaction mixture containing 5 mM ATP, 0.3 mM CoA, 
10 mM Mg 2 * and 0.2 \iM [ 3 H]palmitate (1 uCi) for 20 
minutes at 37 °C, pH 8. In dotted bars, the reaction 
mixture contained an extra 50 uM ATP. Optimal control 
reaction (100 % activity) = 65,890 cpm. Values are the 
mean of three assays. The graph shows the V max and K m 
of the purified PfACSl x 6His calculated from a Line- 
weaver-Burk plot with a substrate concentration ranging 
from 5 to 50 uM. 



The ACS activity of PfACSl x 6His was optimum 
at a pH of 8-9 (not shown), a rather basic pH com- 
pared with other ACS's organisms. 



Long-chain acyl-CoA synthetase activity of the 
recombinant PfACSl x6His 

The ACS activity of the purified recombinant 
enzyme was determined using [ 3 H]palmitate as a 
substrate and saturating concentrations of the rest 
of the co-substrates. As shown in Figure 6, not 
UTP, AMP, nor GTP at 5 mM may substitute the 
ATP for optimal activity. AMP, a product of the 
reaction in AMP-forming enzymes, had an inhibi- 
tory effect at concentrations of 5 mM in a reaction 
mixture containing ATP at 50 uM. These exper- 
iments indicated that the purified PfACSl x 6His 
was an acyl-CoA synthetase enzyme completely 
dependent on the presence of ATP, CoA, Mg 2 " 1 ", 
and fatty acid for activity. The PfACSl x 6His 
exhibited an apparent K m of 19(±3) uM and a 
specific activity (V^J of 40(±3) nmol/minute per 
mg of PfACSl protein at 37 °C calculated from a 
Lineweaver-Burk plot with a substrate concen- 
tration ranging from 5 to 50 uM (plot not shown). 



Immunolocalization of PfACSl in 
infected erythrocytes 

As shown in the immunofluorescences of 
Figure 7, asynchronous P. /a/ripflrwm-infected ery- 
throcyte cultures (~10% parasitemia), were posi- 
tively stained with the polyclonal antibody arose 
against the C-terminal part of the PfACSl protein 
(anti-15 antibody). Neither the pre-immune serum 
nor the second FITC-conjugated antibody bound to 
the cells (not shown). The IFA staining revealed 
the presence of vesicular structures in the erythro- 
cyte cytoplasm (Ec) as early as five hours post- 
invasion. The staining pattern of infected ery- 
throcytes evolved during the parasite life-cycle in 
the infected erythrocytes, which lasts for 46-48 
hours, likely as a consequence of the accumulation 
of vesicle structures bearing PfACSl between the 
parasite and the parasitophorous vacuolar mem- 
brane (pvm). Thus, when the pvm was stained 
with the antibody, at mid-late stage, many infected 
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Figure 7. Localization of PfACSl protein in 
P. falciparum-mfected erythrocytes by indirect FITC 
immunofluorescence microscopy, (a) Asynchronous 
culture of infected erythrocytes (iE) stained with anti-15 
antibody; (b) the same field as (a) using phase-contrast 
optics. At bottom left is shown a magnified image of the 
iE selected in (a), and processed with the Adobe Photo- 
shop program to increase contrast and densities and 
show in relief different parts of this trophozoite-infected 
erythrocyte, in which Ec stands for erythrocyte cyto- 
plasm; pv, parasitophorous vacuole; pvm, parasitophor- 
ous vacuolar membrane; and v, pv budding vesicles. At 
bottom right is the corresponding magnification of the 
same iE under phase contrast microscopy, in which the 
dark hemozoine crystal (he) is indicated with an arrow, 
he is a metabolic product of hemoglobin produced by 
mature infected erythrocytes (trophozoite and schizonts) 
in the digestive vacuole of the parasite. 



erythrocytes contained cytoplasmic vesicles (v) and 
membranous protrusions budding off the parasito- 
phorous vacuole (pv) Figure 7, iE, left panel). The 
presence of a black pigment of hemozoine crystals 
(he) in the parasite digestive vacuole is character- 
istic of mature tropozoite-stage parasite (Figure 7, 
iE, right panel). 

Immunoelectron microscopy of infected erythro- 
cytes labeled with the anti-15 antibody (Figure 8) 
was in agreement with the immunofluorescence 
pattern. Although the level of labeling was low, 
several kinds of vesicle-like structures were shown 
to contain consistently and specifically PfACSl epi- 
topes. All these structures were located in the cyto- 
plasm of the infected erythrocyte, outside the 
parasite. The antibody bound to small irregular 
structures of M).3 urn, in which the epitopes 
seemed to be on the surface of membranes (micro- 
graphs cl) or in linear disposition parallel the 
erythrocyte plasma membrane, but clearly in the 




Figure 8. Immunoelectron microscopy of P. falci- 
parum-mfected erythrocytes labeled with anti-15 anti- 
body and protein G-gold (particles of 10 nm diameter). 
Arrows in the photomicrogaphs indicate different struc- 
tures containing PfACSl protein located in the erythro- 
cyte cytoplasm (Ec) and outside the intracellular 
parasite (P) in all the cases observed. In structures (al) 
and (a3), the PfACSl appears to be located inside the 
vesicle or as possible protein aggregates; (a2) is a ves- 
icle-like structure that did not show labeling with the 
anti-15 antibody; (cl) shows apparent labeling on the 
surface of membrane structures of this vesicle; and (bl) 
is a linear string of labeling close of the erythrocyte 
membrane (Em). Pvm signifies parasitophorous vacuole 
membrane. All the photographs were taken with a Zeiss 
electronmicroscope at (a) 10,000 x and (b) and (c) 
50,000 x . The negative films were scanned-digitalized 
and the images processed with Adobe Photoshop to get 
the best contrast. 



inner place (micrograph B, bl). In addition, the 
antibody bound to an apparent distinct structure 
in which the reactivity was localized either in the 
interior of the vesicles or in protein aggregates. All 
of these structures may correspond with . small 
Maurer's clefts (100-600 nm diameter) or non- 
membrane bound aggregates in the host cytoplasm 
that may be active in the trafficking of proteins 
and membranes between the parasite and the host 
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plasma membrane (Barnwell, 1990; Gormley et al, 
1992). 



Stage-specificity of the Pf ACS1 expression 

In order to determine what IF pattern corre- 
sponded with what asexual stage (Figure 9), 
samples from highly synchronous cultures were 
prepared as before and the PfACSl expression ana- 
lyzed. After merozoite entry into the erythrocyte, 
the parasite develops a vacuole (v) and becomes a 
ring form (R), shaped like a signet ring, with the 
nucleus as the seal. This stage lasts ~18 hours. The 
rings grows by increasing its cytoplasm and feed- 
ing on the erythrocyte cytoplasm by ingesting 
hemoglobin, which is digested in food vacuoles. 
This stage persists for 18-30 hours. During this 
time, the parasite is the most metabolically active. 
During the following 16-18 hours, the schizont 
stage (S), the nucleus divides three to five times to 
form 8-32 nuclei. At this stage, the parasite 
occupies most of the erythrocyte and the pigment 
is concentrated in a single mass. In Figure 9, as 
early as five to ten hours post-invasion, at the early 
ring stage, PfACSl was detected at the erythrocyte 
cytoplasm in vesicle-like structures. At the mid-tro- 
phozoite stage (15-20 hours post-invasion) PfACSl 
was detected both at the parasite and spread across 
the host cytoplasm. From this time on (32 hours 
and 40 hours), the antibody stained the whole 
parasite in the limits of the pv and vesicle struc- 
tures at the erythrocyte cytoplasm (Ec). These 
observations suggest that the PfACSl protein is 
segregated at the space between the parasite and 
the pvm or the pvm before export into the erythro- 
cyte cytoplasm where it accumulates and makes it 
possible to be detected with the antibody. 



Discussion 

We have isolated and characterized a member of 
the long-chain fatty acid CoA synthetase family 
from P. falciparum-mfected erythrocytes as shown 
by the enzymatic activity of the recombinant pro- 
duct purified from bacuiovirus-transfected insect 
cell line. The enzymatic reaction of these type of 
enzymes is carried out by two-step mechanism 
that proceeds through the pyrophosphorolysis of 
ATP (Groot et al, 1976): (1) fatty acid + ATP Mg 2 * 
[fatty acid-AMP] + PPi; (2) [fatty acid-AMP]+ 
CoASH = Fatty acid-S-CoA + AMP. As shown in 
Figure 6, the enzymatic activity of the recombinant 
PfACSl was completely dependent on ATP, Mg 2 " 1 ", 
CoA and fatty acid, which confirms the activity of 
the purified protein and the suggestions of the 
homology sequence analysis. We have shown that 
the p/acsl gene is transcribed during the erythro- 
cyte stages, and the molecular mass of the 
denatured protein product (100 kDa) expressed in 
the parasite is in accordance with the calculated 
mass of the deduced amino acid sequence. 
Southern blot analysis revealed several homolo- 
gous genes in the parasite genome that hybridized 
with p/acsl gene probes. This situation would be 
more like what has been found in acyl-CoA 
synthetases from eukaryotes than in bacteria. Thus, 
for example, the S. cerevisiae acyl-CoA synthetase 
family comprises at least four members (Faalp to 
Faa4p) (Johnson et al, 1994), rat acyl-CoA synthe- 
tase is constituted so far by four members (Kang 
et al, 1997), while only one has been described 
in E. coli (fadD) (Black et al, 1987). These data 
have important biological consequences, since the 
variability of enzymes brings with it the possibility 
of different functional roles. In the yeast S. cerevi- 
siae, the Faalp is responsible for activation of 
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Figure 9. FITC immunofluores- 
cence of stage-specific expression of 
the PfACSl protein in synchronized 
P. /fl/c/parum-infected erythrocyte 
with the anti-15 antibody. Cultures 
were prepared ten hours after syn- 
chronization at the ring stage (R); 
after 20 hours at the ring- tropho- 
zoite stage (R-T); after 32 hours at 
the trophozoite stage (T); and after 
40 hours at the trophozoite-schi- 
zont stages (T-S). The bar in 32 h 
represents 10 \im. Immunofloures- 
cence with the anti-TrpE antibody 
and normal mouse sera as controls 
gave a negative staining (not 
shown). 
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imported fatty acids to their CoA derivatives, and 
Faa2p and Faa3p are able to utilize endogenous 
but not exogenously imported fatty acid substrates 
(Johnson et al, 1994). Rat acyl-CoA synthetase 4 
preferentially uses arachidonate and eicosapen- 
taenoate among C8-C22 saturated fatty acids and 
C14-C22 unsaturated fatty acids (Kang et al, 1997). 
In P. falciparum, with a complex life-cycle involving 
different stages in human and insect host cells, it is 
possible that there exists an expression regulation 
mechanism that may give rise to stage-specific 
expression as well as sub-cellular compartmentali- 
zation. 

The apparent K m value of the purified recombi- 
nant product (PfACSl x 6His) for palmitate 
(19 uM) was similar to that observed with enzymes 
from other sources, as for instance, the yeast 
S. cerevisiae Faalp x 6His (20 uM) (Knoll et al., 
1994). However, the V max of the purified product 
(40 nmol/ minute per mg) was 250 times higher 
than the V max of S. cerevisiae Faalp x 6His (Knoll 
et al, 1994). This may be a requirement for the 
very active lipid metabolism that needs to take 
place after erythrocyte infection for membrane 
biogenesis (Haldar et al, 1985; Haldar, 1992; Lauer 
et al, 1997). While S. cerevisiae can activate exogen- 
ous fatty acids as well as synthesize them de novo 
from acetate (Schweizer et al, 1978), Plasmodium 
relies only on the exogenous sources (Holz, 1977; 
Sherman & Greenan, 1984; Haldar et al, 1985; 
Beaumelle & Vial, 1998). 

The immuno-localization of PfACSl protein in 
the cytoplasm of the infected erythrocyte, in ves- 
icle-like structures (possibly small Maurer's clefts), 
together with the fact that the PfACSl protein 
remains in the pellet of inside-out-vesicle prep- 
arations (data not shown) indicates that it is a 
membrane-associated protein. This conclusion is 
further supported by the fact that saponin extracts 
of infected erythrocytes did not dissolve the 
PfACSl protein as assessed by Western blots 
(Figure 2). This saponin treatment has been shown 
to fail to separate host erythrocyte membrane from 
the parasite, regardless of its stage (Beaumelle et al, 
1987), suggesting that it could interact with host 
erythrocyte membrane proteins or with parasite 
saponin-insoluble proteins (Figure 2). A molecular 
aspect that further support the interaction of 
PfACSl with other proteins is the 70 amino acid 
residue region located at the C terminus of the 
PfACSl protein. This domain, positively charged, 
is not found in other acyl-CoA synthetases so 
far described, suggesting that it does not play a 
role in the enzymatic activity, but rather, it may 
be involved in extra-parasite interactions. The 
N- terminal hydrophobic 20 amino acid residue 
region showed characteristics of a signal sequence 
(Martoglio & Dobberstein, 1998), and because the 
cellular localization of PfACSl, it could be required 
for directing the protein from the parasite toward 
the erythrocyte cytoplasm. With regard to this, the 
PfACSl protein may be of great value to further 
clarify the processes and the interactions between 



host and parasite proteins at the molecular level 
and how the parasite proteins are transported to 
the erythrocyte membrane, processes that are not 
completely understood (Knapp et al, 1991; Wiser, 
1991; Gormley et al, 1992). 

Given the early expression of PfACSl in infected 
erythrocytes and the fact that the parasite, in its 
host erythrocyte, is unable to carry out fatty acid 
synthesis de novo (Holz, 1977; Sherman & Greenan, 
1984; Haldar et al, 1985; Beaumelle & Vial, 1998), it 
is reasonable to assume that this enzyme might 
play a role in the formation of the tubu vesicular 
membranes that extends from the parasite's 
vacuole membrane to the periphery of the red cell. 
This vesicular network constitutes a transport sys- 
tem that allows efficient access of nutrients to the 
parasite (Lauer et al, 1997) and it is essential for 
Plasmodium as intracellular parasite. Our data 
further support the importance of this PfACSl, 
since its putative role in the activation of fatty 
acids would take place at the right place where the 
enzyme and substrates are concentrated (fatty acid 
from host membrane and serum). From this cellu- 
lar localization, their fatty acyl-CoA derivatives 
may be used for phospholipids synthesis and 
incorporated into the newly synthetasing mem- 
branes and fatty acid metabolism. In mammals, 
this biochemical activity is blocked with Triacsins 
inhibitors (Tomoda et al, 1991) and with fatty acid 
analogues in P. falciparum (Beaumelle & Vial, 
1998). With a view to a possible chemotherapeutic 
approach, it is interesting that the localization of 
these kinds of enzymes outside the parasite, in the 
host erythrocyte cytoplasm, since a potential drug 
would have to pass through only one bilayer mem- 
brane to reach its target protein. The presence of 
similar enzymes in the human host could make it 
difficult to find specific inhibitors. However, the 
high degree of dissimilarity between human acyl- 
CoA synthetases and P. falciparum ACSs (PfACSl) 
suggests the possibility of finding specific inhibi- 
tory drugs with a good therapeutic index. 

Material and Methods 

Oligonucleotides 

Oligonucleotides were synthesized using an automatic 
DNA synthesizer (Applied Biosy stems model 381). The 
matching sequence at the Pfacsl gene as well as the 
restriction site (underlined) and other traits included in 
the primer are following indicated: 1165 (nucleotides 
1549-1565/GGAGGGCCTATTTGTCC); 1389 (1830-1846/ 
G A AT AATCTCT ATTCGC); 1393 (1339-1322/CAAGAA 
AATTAACTAACC); 1859 (1565-1 549 /GG AC AAATAGG 
CCCACCC); IP8 (2264-2248/CTAATAGGAGCACAT 
CC); IP15 (463-481 /AGTGGCGTTACTAC AT TAG); IP18 
(614-595 /TTCTTCAAGTGTGGC A ATTC); IP19 with an 
EcoRI site (1-20/GGAATTCTAATGGTATCATTTTAAA- 
TAT); IP20 with an EcoRI site (2463-2442 /G G A ATTC T- 
CATTTATTCATTTCTTTTTGA); IP22 (904-923/GA 
GGATCCTAA TTTGTTAC); IP23 (923-904 /GTAAC A A 
AATTAGGATCCT); IP28 with an EcoRI site, termination 
codon (in bold), six histidine codons (second underlined 
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sequence), and primer (2460-2440 /GGGAATTCTT A 
GTGATGGTGATGGTGATGT TTATTC ATTTCTTTTTG 
ACG). Matching at the cloning flanking site of 
XgtlO:IP13 with a BamHl site (CGGGATCCGCCT 
GGTTAAGTCCAAGC); IP14 with a BamHl site 
(CGGGATCCATGAGTATTTCTTCCAGGGTA). 



Parasite culture and cell lines 

The 3D7 strain of P. falciparum was cultured in human 
erythrocytes under a low oxygen atmosphere (5% 
5 % CO z and 90 Ny according to Trager & Jensen (1976). 
Synchronization was attained by Percoll gradient cen- 
trifugation method (Braun-Breton et al, 1986). Parasites 
were released from infected erythrocytes by saponin 
lysis (0.15% (w/v) saponin in PBS) as described 
(Beaumelle el al, 1987). Spodoptera frugiperda Sf9 cells 
were grown in TC100 medium (Gibco) supplemented 
with 10% (v/v) fetal calf serum, 100 units/ml penicillin 
and 100 ug/ml streptomycin. 



Libraries, clones and accession numbers 

The P/acsl sequence (Figure 1, scheme) was isolated 
from a XgtlO P. falciparum genomic library that we con- 
structed as described by Anderson & McDonald (1993). 
Genomic DNA from the P. falciparum was digested with 
EcoRI start activity, and the 2-4 kb generated fragments 
cloned into the EcoRI site of A.gl0 vector (Promega). The 
first screening was performed using a 715 bp long probe 
from clone 15 (probe-c, Figure 2) (digoxigenin (DiG)- 
labeled IP8-1165 fragment, generated by PCR) giving, as 
a result, clone 34. Clone 15 is a 1222 bp long insert, 
obtained from a Xgtll mung bean generated genomic 
library (McCutchan et al, 1984) of the IMMT strain (7G8 
clone), which was positive for malaria-falciparum 
immune sera from different geographical regions (data 
not shown). The clone 15 sequence has been deposited in 
the GenBank database with the accession number 
U10121. The XgtlO clone 34 was PCR-amplifted with pri- 
mers IP13-IP14 and the resulting purified insert was 
cloned into the pGem-T-vector (Promega) for sequen- 
cing. A 151 bp long DIG-labeled probe (probe-a, 
Figure 2), obtained by PCR with primers IP15 and IP18 
from the 5'-end of clone 34, was used for a second 
screening of the same library giving the clone 27. The 
whole sequence of P/acsl has been deposited in the Gen- 
Bank database under GenBank accession AF007828. 



DNA sequence analysis 

DNA sequences were determined by PCR /fluorescent 
nucleotides of both strands using a DNA Sequencing Kit 
(Perkin Elmer) in an ABI373 DNA sequencer (Applied 
Biosystems, Foster City, GA) with vector primers as well 
as the internal ones. The DNA sequence analysis was 
carried out using Strider 1.1 software (Marck, 1988) and 
the sequence analysis package Genetics Computer 
Group version 9.0 (University of Wisconsin, Madison, 
WI) (Devereux et al., 1984). Searches made by the BLAST 
program were performed in the non-redundant protein 
library at the National Center for Biotechnology Infor- 
mation. Comparison analysis was performed with the 
PILLEUP and BOXSHADE softwares. Prediction of 
transmembrane structure was carried out at GenomeNet 
WWW server/MOTIF, Institute for Chemical Research, 
Kyoto University, Japan. 



Southern blot and probes 

Genomic DNA was isolated by the SDS-proteinase K 
method and carried out as indicated by standard meth- 
odology (Ausubel et aL, 1990). After digestion with var- 
ious restriction enzymes according to the manufacturer's 
recommendations, DNA was fractionated on a 1 % agar- 
ose gel, blotted onto nitrocellulose filters (Schuell and 
Schuells) and UV cross-linked at 1.2 x 10 s uj/cm 2 . Filters 
were hybridized with [a-^PJATP-labeled probes 
obtained by random priming using a DNA labeling kit 
(Pharmacia Biotech) in 5 x SSC, 15 % (w/v) SDS, 1 % 
blocking reagent (Boehringer Mannheim) for 18 hours at 
60 °C. Blots were washed in 0.2 x SSC, 0.1 % (w/v) SDS 
at 60 °C before exposure to Curix RP2 Agfa him at 
-70 °C. The probes-a (primers IP15-IP18, 151 bp long) 
and b (primers 1389-IP8, 434 bp long) were obtained by 
PCR amplification from clone 34 DNA, purified, and 
then radiolabeled. 

RNA Isolation, Northern blot and probes 

Total parasite RNA was extracted using the method 
described by Chomczynski & Sacchi (1987) and the 
Northern performed following standard methodology 
(Ausubel et al, 1990). Briefly, the RNA (10 ug) was 
separated by 1.2% agarose /formaldehyde gel electro- 
phoresis and transferred onto Biotrans nylon membrane 
(ICN), UV cross-linked and hybridized for 18 hours at 
58 °C with random-priming Dig-labeled probe-c (715 bp 
long, primers 1165-IP8). Blots were washed with 
0.5 x SSC, 0.1% SDS at 60 °C before applying an immu- 
nodetection procedure with chemoluminiscent substrate 
CDP-star (Boehringer Mannheim) according to the 
manufacturer's recommendations. 

Production of mouse specific antisera against clone 
15 (anti-15) 

The 1222 bp long insert of the clone 15 coding for an 
amino acid sequence identical with the carboxy- terminal 
397 amino acid residues of the PfACSl (clone 34) was 
subcloned into the EcoRI site of the pATHll fusion 
expression vector (Yansura, 1990) in frame with the 
anthranilate synthetase (TrpE) bacterial protein. The 
pATHll construct was used to transform E. coli C600, 
and induced the expression according the procedures 
already described (Ausubel et al, 1990; Yansura, 1990). 
The bacterial extracts containing the fusion protein TrpE- 
15 (78 kDa) and TrpE (35 kDa) were purified by prepara- 
tive electro-elution from a SDS-PAGE system as 
described by Matesanz & Alcina (1996). Female eight- 
week old Swiss mice were injected with 10 ug of highly 
purified (>95 %, data not shown) fusion protein TrpE-15 
or TrpE as control, in three doses over a period of two 
months: the first dose emulsified in complete Freund's 
adjuvant, and the second in incomplete Freund's adju- 
vant, were injected intra peritoneally; the third does was 
injected intravenously in PBS. The serum was adsorbed 
of irrelevant cross-reactivity and the specificity con- 
firmed in Western blots of bacterial extracts expressing 
TrpE-15 fusion protein and TrpE (data not shown). 

Western blots 

Extracts of P. falciparum infected erythrocytes (iE) at 
5-10% parasitemia, and uninfected erythrocytes (E), 
pelleted by centrifugation and lysed with 0.15% saponin 
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in PBS to release the parasites; and recombinant and 
mock virus-infected Sf9 cells, and purified recombinant 
parasite protein (PfACSl x 6His) were subjected to 
SDS-PAGE under reducing conditions and Western blots 
following standard procedures (Ausubel et aL, 1990). 
Blots were incubated with 1/1000 dilution of the mouse 
anti-15 antibody for four hours at 22 °C, followed by 
horseradish peroxidase-conjugated secondary antibody 
incubation (1/15,000 dilution) (Pierce), and developed 
with the chemoluminiscent substrate ECL (Amershan). 

Construction of PfACSl x 6His expression vector 

The whole coding sequence of PFACS1 x 6His 
(2493 bp long) containing an added tag of 6 x His at the 
C- terminal was obtained by PCR reaction using Pfu Taq 
polymerase (Stratagene) with 10 ng of P. falciparum DNA 
and the primers IP19-IP28 in the following reaction con- 
ditions: a single five minute, 95 °C denaturation step, 35 
cycles at 95 °C for 075 minute; 47 °C for 1.5 minutes; 
72 °C, three minutes; and ten minutes at 72 °C as a final 
extension step. The PCR-amp lined insert was cloned into 
the EcoRI site of the pFastBacl plasmid of the Bac-to-Bac 
Baculovirus Expression System (Life Technologies, 
Gibco-BRL) and transfected into E. coli DHlOBac as 
indicated by the manufacturer. The sequence was then 
confirmed. The resulting recombinant bacmid was trans- 
fected by lipofection into Sf9 cells and the supernatant 
containing recombinant or mock virus collected for fol- 
lowing infections. The PfACSl x 6His expression by the 
Sf9 infected cells were monitored by Western blots with 
anti-15 antibody. 

Purification of PfACSl x 6His from recombinant 
Sf9 cells 

All purification procedures were carried out at 4°C. 
Twenty dishes (100 mm diameter) with 7 x 10 6 recombi- 
nant or mock infected Sf9 cells were harvested at 48 
hours after infection, and resuspended in 6 ml of lysis 
buffer A (50 mM potassium phosphate (pH 7.4), 
300 mM potassium chloride, 400 mM sodium chloride, 
40 mM imidazole, 20% (w/v) glycerol, 1% (w/v) Triton 
X-100, 4 mM 2-mercaptoethanol and 1 mM phenyl- 
methylsulfonyl fluoride (PMSF)), disrupted by sonication 
and centrifuged at 10,000 g for ten minutes. The resulting 
supernatant was added to 300 ul of buffer A-equilibrated 
Ni +2 -NTA resin (QIAGEN Inc.), and the slurry was 
stirred for four hours at 4 °C. The resin was washed four 
times with five volumes of buffer A, and three further 
washes with buffer A containing 2 mM ATP. Protein 
was eluted with 300 ul of buffer A containing 2 mM 
ATP, and 240 mM imidazole. Protein concentration of 
eluted active fractions was measured with a Bicinchonic 
acid/CuS0 4 protein determination kit (Sigma) and by 
densitometry of the band in Coomassie blue-stained gel 
compared with stained molecular mass standard (Phar- 
macia Biotec). It was between 8 and 12 ug/ml. Extracts 
of mock-infected Sf9 cells were subjected to the same 
procedure. 

Enzymatic activity of purified PfACSl x6His 

Acyl-CoA synthetase activity was determined 
by the isotopic assay described by Wilson et ai 
(1982) using [9,10(n)- 3 H] palmitic acid in ethanol 
(52 Ci/mmol) (Amersham Corp., Buckinghamshire, 
United Kingdom) as one of the co-substrate. Briefly, 
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100 ul of incubation mixture contained 2 ul of purified 
preparation of PfACSl x 6His (20 ng) in Tris-HCl buf- 
fer (pH 8) (100 mM Tris-HCl, 10 mM MgC12, 1 mM 
KF, 2mM EDTA, 2 mM dithiothreiol), 5 mM ATP, 
300 uM CoA and 0.2 uM [ 3 H]palmitic acid (1 uCi), 
was incubated for 20 minutes at 37 °C. The reaction 
was stopped by the addition of 125 ul of Dole's sol- 
ution (isopropyl alcohol, heptane, 1 M H2SO4, 40:10:1, 
by vol.) and 50 ul of water followed by 500 ul of hep- 
tane and vigorous vortexing. After centrifugation for 
one minute at 10,000 g the upper phase was dis- 
carded. The lower aqueous phase was then washed 
twice more: 500 ul of scintillation liquid was added to 
each tube and counted in a MicroBeta Plus counter 
(Wallac, EG&G Company, Turku, Finland). The K m 
and of the PfACSl shown inside the plot of 

Figure 6 was determined in a reaction mixture con- 
taining palmitic acid from 5 to 50 uM by applying the 
Lineweaver-Burke plot (not shown). The effect of pH 
was carried out in a reaction mixture containing Mops 
as the buffer (Knoll et aL, 1994). The purified fraction 
on Ni 2+ -NTA resin from mock-infected Sf9 cells gave 
no activity. 

Indirect immunofluorescence assay (IFA) 

Thin smears of parasitized erythrocytes from asyn- 
chronous or synchronous cultures of P. falciparum were 
air-dried and fixed in cold acetone for ten minutes and 
processed for IFA as described (Alcina et aL, 1986). Fixed 
preparations were overlaid with 25% (v/v) fetal calf 
serum (FCS) in PBS for 15 minutes, and incubated with 
the first antisera (anti-15 or anti-TrpE) at 1/500 dilution 
for two hours. A second and third reagent, (anti-mouse 
biotinylated antibody and fluorescein isothiocyanate 
(FITC)-conjugated streptavidin, respectively) were used 
at dilutions as recommended by the manufacturer (Boeh- 
ringer Mannheim). Photomicrographs were taken using 
a Carl Zeiss Axiophot microscope. 

Immunoelectron microscopy 

Localization of the PfACSl antigen in infected erythro- 
cytes by immunogold labeling was performed following 
a modified procedure as described (Hern£ndez-Munain 
et aL, 1991). Briefly, infected erythrocyte cultures at 
about 8% parasitemia were fixed in 3% (v/v) glutaral- 
dehyde/25 mM sodium cacodylate buffer (pH 6.9) for 14 
hours. After washing in cacodylate buffer, cells were 
pelleted into agarose. The pellets were dehydrated in 
ethanol and finally in propylene oxide. Agarose blocks 
containing the cells were embedded in Epon 812 resin 
and polymerized at 60 °C for two days. Ultramicrotome 
sections of 70 nm were picked up on 200-mesh carbon- 
coated gold grids. The grids were then placed in a moist 
chamber for the following incubations: one hour in 25 % 
FCS/PBS, followed by two hours incubation with 1/500 
dilution of the anti-15 antibody in FCS/PBS. Negative 
controls included anti-TrpE antibody or no first 
antibody. The second reactive was a gold-labeled protein 
G of particle size 10 nm (Sigma). The grids were washed 
in PBS and water and finally they were post-stained 
with 2 % (w/v) uranyl acetate. 
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A region of genomic DNA from Clostridium kluyveri was cloned in Escherichia coli by a screening strategy 
which was based on heterologous expression of the clostridial 4-hydroxybutyrate dehydrogenase gene. The gene 
region (6,575 bp) contained several open reading frames which encoded the coenzyme A (CoA)- and NADP+- 
dependent succinate-semialdehyde dehydrogenase (sucD), the 4-hydroxybutyrate dehydrogenase (4hbD)> and a 
succinyl-CoA:CoA transferase (cat J), as analyzed by heterologous expression in E. coli. An open reading frame 
encoding a putative membrane protein (orfY) and the 5' region of a gene encoding a o- 54 -homologous sigma 
factor (sigL) were identified as well. Transcription was investigated by Northern (RNA) blot analysis. Protein 
sequence comparisons of SucD and 4HbD revealed similarities to the adhE (aad) gene products from E. coli and 
Clostridium acetobutylicum and to enzymes of the novel class (III) of alcohol dehydrogenases. A comparison of 
CoA-dependent aldehyde dehydrogenases is presented. 



The gram-positive anaerobic bacterium Clostridium kluyveri 
(3) ferments ethanol and acetate to butyxate, caproate, and 
molecular hydrogen (7). ATP, required for growth, is gained 
by substrate-level phosphorylation from acetyl phosphate, and 
the quantity is proportional to the amount of hydrogen pro- 
duced (46, 50). Investigations of additional metabolic abilities 
revealed that this organism can utilize crotonate, vinylacetate, 
and 4-hydroxybutyrate as substrates (4, 5) and is able to fer- 
ment the unusual substrate combination of succinate plus eth- 
anol (27). A pathway was proposed, one in which succinate is 
first activated and then reduced by a two-step reaction to give 
4-hydroxybutyrate, which is then further metabolized to crot- 
onyl-coenzyme A (CoA) (Fig. 1) (27). In a previous study, we 
discussed enzymes involved in the anaerobic breakdown of 
succinate by C. kluyveri, specifically, a succinyl-CoA:CoA trans- 
ferase, a CoA- and NADP + -dependent succinate-semialde- 
hyde dehydrogenase, and a 4-hydroxybutyrate dehydrogenase 
(49). Wolff et al. (55) independently confirmed these data by 
* 3 C-nuclear magnetic resonance studies as well as enzymatic 
investigations on the dehydrogenases. 4-Hydroxybutyryl-CoA 
dehydratase, which catalyzes the last step of the succinate- 
specific pathway, the dehydration and isomerization of 4-hy- 
droxybutyryl-CoA to crotonyl-CoA, was recently identified, 
purified, and characterized (45). We present here some mo- 
lecular aspects of this pathway, including the cloning, sequenc- 
ing, and heterologous expression of a C. kluyveri DNA region 
which encodes a succinyl-CoA:CoA transferase, the succinate- 
semialdehyde dehydrogenase, and the 4-hydroxybutyrate de- 
hydrogenase. 

MATERIALS AND METHODS 

Bacterial strains, plasmids, media, and growth conditions. C. kluyveri (DSM 
555) was obtained from the Deutsche Sammlung von MikroorganLsmen und 
Zcllkulturcn GmbH, Braunschweig. Germany. Escherichia coli JM109 (58) and 
the pBluescripl SK vector (Slralagene, San Diego, Calif.) were from the labo- 
ratory collection. C. kluyveri cells used for DNA isolation were grown at 37°C 
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under strictly anaerobic conditions on ethanol (300 mM) and succinate (100 
mM) as previously described (49). Cells for RNA preparation were cultured in 
the same medium except that succinate was replaced by acetate (100 mM). £. coli 
cultures were routinely grown at 30°C in Luria-Bertani (LB) medium (42) on a 
rotary shaker. Tetrazolium indicator plates (6) containing 4 g of 4-hydroxybu- 
tyrate per liter were employed for the screening procedure (oxidation of 4-hy- 
droxybutyrate). Utilization of 4-hydroxybutyrate (4 g/liter) as a carbon source for 
recombinant E. coli clones was investigated in M9 medium (42), supplemented 
with a small amount of yeast extract (0.2 g/liter), MgS0 4 (2 mM), and CaCl 2 (0.1 
mM). Ampicillin (75 mg/liter) was added to the media for £. coli as a selection 
marker, when needed. 

Nucleic acids isolation and recombinant DNA techniques. Chromosomal 
DNA from C. kluyveri was isolated by the method of Sailo and Miura (41). Total 
RNA from C. kluyveri was isolated by the hot phenol-chloroform procedure as 
described by Gerischer and Durre (19). DNA was manipulated by standard 
methods (42); restriction enzymes and T4 DNA ligasc were purchased from 
GIBCO/BRL (Eggenslein, Germany). For plasmid isolation from E. coli, the 
Quiagen Midi Kit (Diagen GmbH, DOsseldorf, Germany) was used. For cloning 
purposes, genomic C. kluyveri DNA was partially f/indlll digested and fraction- 
ated on a sucrose gradient (10 to 40% [wt/vol]). Fractions of approximately 3 to 
5 and 4 to 7 kb were ligaled into Hindi II -digested pBluescripl SK vector, and the 
product was used to transform E. coli JM109. Recombinant clones were screened 
for their ability to oxidize 4-hydroxybutyrate (see above). Nested deletion sub- 
clones were prepared from pCKl and pCK3 (the clostridial inserts were in a 
different orientation within the vector) by using the Kpnl and CM sites of the 
vector to generate the exonuclease IH-resistant and -sensitive ends, respectively. 
Exonuclease III digestion and all further steps were performed with the Erase- 
a-Basc system (Promcga, Madison, Wis.) according to the manufacturer's in- 
structions. 

DNA sequencing and sequence analysis. Double-stranded DNA was se- 
quenced by the dideoxy chain termination method (43) with a- 35 S-dATP (Du- 
Ponl, NEN Research Products) and the Sequenase T7 DNA polymerase kit from 
U.S. Biochemical (Bad Homburg, Germany) according to the corresponding 
protocol. The entire sequence (6,575 nucleotides) of the C. kluyveri insert of 
pCKl was determined lor both strands with the nested deletion subclones gen- 
erated from pCKl and pCK3 and the commercially available M13/pUC universal 
and reversal sequencing primers. In addition, some synthetic oligonucleotides 
(17-mcrs) complementary to the already sequenced templates were employed. 
These primers were prepared with a Gene Assembler Plus (Pharmacia Biotech 
Europe, Freiburg, Germany) according to the manufacturer's instructions. 

Computer sequence analysis. The DNA sequence data and the deduced amino 
acid sequences were analyzed with the Genetics Computer Group Inc. sequence 
analysis software package, version 6.2 (13), on a VAX 9000 computer. Database 
searches were performed with the National Biomedical Research Foundation 
Protein Information Resource Network Server according to the algorithm of 
Pearson and Lipman (38). 

Hybridization. Total chromosomal DNA from C. kluyveri or E. coli plasmid 
DNA was digested to completion with the appropriate restriction enzyme and 
separated on agarose gels. Southern blots on nylon membranes (GeneScreen 
Plus; DuPont, NEN Research Products) were prepared according to the manu- 
facturer's protocol. DNA fragments used as probes were isolated from agarose 
gets with the Gene Clean Kit (Bio 101, La Jolla. Calif.). The probes were labeled 
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FIG- 1. Schematic pathway for the ethanol -succinate fermentation of C. 
kluyveri. 1, alcohol dehydrogenase; 2, acetaldehyde dehydrogenase; 3, thiolase; 4, 
p-hydroxybutyryl-CoA dehydrogenase; 5, crotonase; 6, butyryl-CoA dehydroge- 
nase; 7, CoA-transfer&se (probably several en7ymes with different substrate 
specificities); 8, succinatc-scmialdchydc dehydrogenase; 9, 4-hydroxybutyratc de- 
hydrogenase; 10, 4-hydroxybulyryl-CoA dehydralase-vinylaeelyl-CoA A 3 -A 2 - 
isom erase, as described elsewhere (21, 45, 49, 55). The formation of acetate, 
hydrogen, and caproate is not shown. 



either with [a- 32 P]dATP (DuPont, NEN Research Products) by using the Ran- 
dom Primed DNA Labeling Kit (U.S. Biochemical) or with digoxigenin by using 
the DIG DNA Labeling Kit (Boehringer GmbH, Mannheim, Germany). Mem- 
branes were prehybridized in 0.15% (wt/vol) polyvinylpyrrolidone-0.15% (wt/ 
vol) bovine albumin-0.15% (wt/vol) Ficoll 400-0.9 M NaCl-10% (wt/vol) dext- 
ran sulfatc-1% (wt/vol) sodium dodccyl sulfate (SDS)-0 mM EDTA-90 mM 
Tris-HCl (pH 7.5)-100 jig or herring sperm DNA per ml (80 ml/cm 2 ) for 1 to 3 
h at 55°C. The appropriate probe was added to the prehybridization solution and 
incubated for 10 to 15 h at 55*C. Membranes were then washed twice in 2x SSC 
(lx SSC is 0.15 M NaCI plus 0.015 M sodium citrate) for 15 min at room 
temperature and once in 2x SSC plus 1% (wl/vol) SDS at hybridization tem- 
perature. If necessary, additional washing was performed in SSC solutions, con- 
taining 1% (wt/vol) SDS, of decreasing ionic strength (lx SSC, 0.1 x SSC) at 
hybridization temperature. Membranes were then subjected to autoradiography 
or, in the case of the digoxigenin-labeled probes, manipulated with the appro- 
priate detection kit (Boehringer Mannheim GmbH). RNA for Northern blot 
analysis was separated in denaturing formaldehyde gels and transferred to nylon 
membranes (GeneScreen Plus; DuPont, NEN Research Products) as described 
in the manufacturer's manual. An RNA ladder (0.24, 1.4, 2.4, 4.4, 7.5. and 9.5 kb; 
G1BCO/BKL) was included as standard for size determination. Hybridization 
and washing were performed according to the protocol described above for the 
radiolabeled DNA hybridization procedure. 

Determination of enzyme activity. Cells from recombinant E. coli clones 
(pCKl, pCK2, pCK3, pCK4, and pSK) were grown at 30*C in LB medium on a 
rotary shaker and harvested by centrifugation. If necessary, isopropyl-B-D-thio- 
galactopyranoside (IPTG; 1 mM) was added at an optical density of 0.5 (580 nra) 
and cells were grown for an additional 3 h and harvested as described above. In 
order to prevent enzyme inaclivalion by oxygen, crude extracts were prepared 
anaerobically in 50 mM potassium phosphate buffer (pH 7.5) containing 3 mM 
dithioerythrito), with a French pressure cell (80 MPa; Am icon, Silver Springs, 
Fla.). Protein concentration was determined according to the method of Brad- 
ford (8). 4-Hydroxybulyrale dehydrogenase was assayed in anaerobic glass cu- 
vettes at 30°C in 90 mM 2-amino-2-methyl-l,3-propanediol-HCl (pH 8.5), con- 
taining 3 mM dithioerythritol, 1 mM MgS0 4 , and 1 mM NAD + in a final volume 
of 1 ml. The reaction was started by the addition of 4-hydroxybutyratc (10 mM) 
and monitored at 340 nm (e = 6.3 mM" 1 cm" 1 ). Activity of sucrinale-semiaJ- 
dehyde dehydrogenase was assayed in anaerobic glass cuvettes in 50 mM TAPS 
(A/-tris[hydroxymethyl]methyl-3-aminopropanesuIfonic acid)-HQ buffer (pH 
8.5) containing 1 mM dithioerythritol— 1 mM NADP + at 30*C in a final volume 
of 1 ml. The reduction of NADP* was monitored at 340 nm. To determine the 
CoA-independcnt enzyme activity, the reaction was initiated by the addition of 
succinate-semialdehydc (10 mM). In the case of the CoA-dependent enzyme 
activity, the assay was initiated with succinate-semialdehyde (10 mM) and then 
started with CoA (0.1 mM). CoA-dependent succinate-semialdehyde dehydro- 
genase activity was defined as the difference between the CoA-dependent and 
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FIG. 1 (A) Schematic map of the pCKl to pCK4 inserts and the ORFs 
deduced from the pCKl nucleotide sequence. Vertical bars indicate fftndlll 
restriction sites. (B) Schematic drawing of some of the subclones generated from 
pCKl and pCK3 by Uie nested deletion method. The E. coli host phenolype 
(oxidation of 4-hydroxybutyrate) is indicated by continuous lines (positive) and 
dashed lines (negative). 



CoA-independent enzyme activity. Succinyl-CoA:CoA transferase was deter- 
mined with succinyl-CoA (0.1 mM) and acetate (0.2 M) as substrates as de- 
scribed by Schcrf and Bucket (44). The reaction was initiated with succinyl-CoA, 
and the acetyl-CoA product was condensed with oxaloacetate-libe rating Co ASH. 
Formation of the latter was determined with 5,5'-dithio-bis(2-nitrobenzoic acid) 
(NbSj) at 412 nm (e = 13.6 mM" 1 cm" 1 ). 

Determination of the N-terminal amino acid sequence. The N-terminal amino 
acid sequence of the purified succinatc-scmialdchydc dehydrogenase was deter- 
mined directly from the eleclroblolted polyvinylidene dilluoride membrane by 
using a Protein Peptide Sequencer 477A (Applied Biosystems, Foster City, 
Calif.). Detection was performed on-line with a phenylthiohydantoin analyzer. 

Nucleotide sequence accession number. The sequence data reported here 
were submitted to the EMBL database and assigned accession no. L21902. 



RESULTS 

Cloning of the C kluyveri gene region. Of 10 4 transformants 
screened, four E. coli clones formed red colonies; they were 
able to reduce the tetrazolium salt added at the expense of 
4-hydroxybutyrate oxidation. pCKl, which was chosen for fur- 
ther analysis, harbored, as shown by sequence analysis, a 
6,575-bp insert, which contained seven Hindlll restriction en- 
donuclease recognition sites (Fig. 2). pCK2 and pCK4 covered 
only a part of pCKl (four //mdlll sites; 4,458 bp) whereas the 
insert of pCK3 (about 7,500 bp) exceeded the pCKl region at 
the left-hand end (Fig. 2). The inserts of pCKl and pCK2 were 
in the orientation of the pBluescript lac promoter, while those 
from pCK3 and pCK4 were in opposite orientation. All clones 
containing the recombinant plasmids pCKl, pCK2, pCK3, and 
pCK4 were able to grow on 4-hydroxybutyrate as the sole 
carbon and energy source, which is not a substrate for the E. 
coli wild type. 4-Hydroxybutyrate dehydrogenase activity was 
detected in all four clones (50 to 160 mU/mg). 
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To confirm an identical arrangement of the Hindlll frag- 
ments within the genomic DNA of C. kluyveri, Southern blot 
analysis was performed. With a 1,087-bp EcoKl and an 864-bp 
Hindlll fragment (bp 900 to 1987 and 2117 to 2981, respec- 
tively) as probes for chromosomal C. kluyveri DNA, digested 
with Pstl, Hindlll, or EcoRI, hybridization signals yielded the 
expected size of fragments (data not shown). From these data 
as well as from the identical nucleotide sequence present in 
pCK3, it was concluded that the cloned Hindlll fragments in 
pCKl represent a contiguous C. kluyveri genomic DNA frag- 
ment and not a multiple ligation of Hindlll fragments. 

Nucleotide sequence analysis. For sequencing purposes and 
for localization of the 4-hydroxybutyrate dehydrogenase-en- 
coding gene, nested deletion subclones were generated from 
pCKl and pCK3 as described in Materials and Methods. The 
complete nucleotide sequence encompassing the insert of 
pCKl (6,575 bp) was submitted to the EMBL data base (see 
Materials and Methods). DNA sequence analysis revealed four 
complete open reading frames (ORFs) (referred to as orfY, 
catl, sucD, and 4hbD) and two truncated ORFs at the 5' and 
3' ends, respectively (orfZ and sigL), in the same transcrip- 
tional orientation (Fig. 2 and 3). An additional reading frame, 
encoding 36 amino acids, was located in the intergenic region 
between orfZ and orfY (bp 856 to 966), but it is likely that it has 
no coding function, since its G+C content (20.7%) was very 
low compared with that of the other ORFs (31.1 to 36.2%) and 
with that reported for genomic DNA (29.8%) from C. kluyveri 
(1). In addition, the nucleotide sequence downstream of the 
corresponding AUG start codon contained an inverted repeat 
with a calculated free energy of -66.5 kJ/mol (17), which could 
inhibit ribosome binding. Two other inverted repeats were 
found downstream of the 4hbD gene. A free energy of -62.8 
kJ/mol was calculated for the first repeat, which is 33 bp down- 
stream of the 4hbD stop codon (bp 6252 to 6285). The corre- 
sponding mRNA stem-loop would be followed by several LPs 
as is typical for a rho-independent terminator (39). The second 
repeat had a calculated free energy of -44 kJ/mol and was 
located 133 bp further downstream. The AUG start codons of 
cat], sucD, and 4hbD were preceded by putative ribosome 
binding sites at a distance of 7, 6, and 7 bases, respectively, with 
reasonable homology to those described for E. coli (47) and for 
Clostridia (60). In the case of the 3'-truncated ORF (sigL), the 
Shine-Dalgarno sequence was at a distance of 17 bases from 
the AUG codon. However, there was another possible start 
codon (GUG) in frame which would reduce the distance to the 
ribosome binding site (9 bases). For orfY, two possible sites for 
translation initiation were found. Upstream of the usual start 
codon, AUG (bp 1,050), with the putative ribosome binding 
site at a distance of 11 bp (AGGAG), there was a second 
possible start codon (UUG, bp 981), which also had a putative 
Shine-Dalgarno sequence at a distance of 8 bp (GGAGG). 

Identification and expression of 4hbD, sucD, and catl. By 
analyzing the phenotype of the nested deletion subclones, the 
DNA region encoding the 4-hydroxybutyrate dehydrogenase 
gene was identified. Subclones having a deletion within the last 
complete ORF (4hbD) were no longer able to oxidize 4-hy- 
droxybutyrate. That this gene (371 amino acids [aa], 41,755 
Da) encodes the 4-hydroxybutyrate dehydrogenase was further 
confirmed by homologies to several class III alcohol dehydro- 
genases (see below). 

The adjacent ORF upstream had an N-terminal amino acid 
sequence which was identical to that determined for the puri- 
fied CoA-dependent succinate-semialdehyde dehydrogenase 
(M-[S]-N-B-V-S-I-K-E-L-I-E-K-A-K-V-A-Q-K-K-1^B.[A]-Y), ex- 
cept for one mismatch in position 16 where a valine was found 
in the protein but an alanine was deduced from the nucleotide 



sequence. Therefore, that ORF was designated sucD. For the 
sucD gene product (472 aa), a molecular mass of 50,915 Da 
was calculated, which was somewhat smaller than that deter- 
mined for the purified enzyme by SDS-polyacrylamide gel elec- 
trophoresis (PAGE) (55 kDa [49]). CoA-dependent succinate- 
semialdehyde dehydrogenase activities in recombinant £. coli 
clones harboring the sucD gene (pCKl to 4) were very low (10 
to 20 mU/mg). 

catl, which was located directly upstream of sucD, encoded 
a 538-aa protein with a calculated molecular mass of 58,852 
Da. A database search showed significant homologies (41.8 
and 40.2% identity and 62.4 and 61.4% similarity) to an acetyl- 
CoA hydrolase (Achl) from Saccharomyces cerevisiae (31) and 
to a gene product from Neurospora crassa (Acu-8) which is 
essential for growth on acetate (34), respectively. For bioen- 
ergetic reasons, we assume that in C. kluyveri this ORF en- 
codes a Co A transferase rather than an acetyl-CoA-hydrolyz- 
ing enzyme. However, as with the CoA-dependent succinate- 
semialdehyde dehydrogenase, the specific activities of a 
succinyl-CoA:CoA transferase in recombinant E. coli clones 
were very low (5 to 16 mU/mg). 

To induce transcription of catl, sucD, and 4hbD starting 
from the pBluescript lac promoter, recombinant clones har- 
boring either a part (pCK169 ['orfZ or/Y catl sucD'; bp 1 to 
3772], pCK2 ['catl sucD 4hbD sigL'; bp 2117 to 6575]) or the 
complete gene region (pCKl, bp 1 to 6575) were grown in the 
presence of IPTG (1 mM), and cell extracts were prepared and 
analyzed for enzyme activity as described in Materials and 
Methods (Table 1). Cell extracts from E. coli harboring either 
pCKl or pCK169 revealed significant succinyl-CoA:CoA trans- 
ferase activity. Since the release of CoASH (monitored by the 
reduction of NbS 2 ) was dependent on all assay components 
(succinyl-CoA, oxaloacetate, citrate synthase, and acetate), a 
simple CoA-hydrolyzing activity like Achl from S. cerevisiae 
could be excluded. CoA-dependent succinate-semialdehyde 
dehydrogenase and 4-hydroxybutyrate dehydrogenase activi- 
ties were present in E. coli (pCKl) and E. coli (pCK2) (Table 
1). In the case of succinate-semialdehyde dehydrogenase, the 
assay was performed both in the presence and in the absence 
of CoA in order to differentiate between the CoA-independent 
E. coli enzyme, which occurred in all clones, and the recombi- 
nant CoA-dependent enzyme from C. kluyveri (Table 1). SDS- 
PAGE analysis during induction of the recombinant E. coli 
clones revealed an increase of protein bands corresponding to 
molecular masses of 66, 55, and 37 kDa (not shown). It is 
apparent that the 66-, 55-, and 37-kDa protein bands corre- 
spond to the catl, sucD, and 4hbD gene products. The 66-kDa 
protein occurred only in recombinant clones harboring the catl 
gene (pCKl and pCK169), whereas the 55- and 37-kDa pro- 
teins were present only in clones harboring the sucD and 4hbD 
genes. 

SucD and 4HbD sequence comparisons. A database search 
with the amino acid sequences of SucD (472 aa) and 4HbD 
(371 aa) revealed homologies to the adhE gene product from 
E. coli (891 aa) and to homologous proteins from two Clos- 
tridium acetobutylicum strains (DSM792, AdhE, 862 aa; ATCC 
824, Aad, 873 aa) and an anaerobic protozoan {Entamoeba 
histolytica, Adh2, 870 aa), respectively (16, 20, 35, 57). The 
protein sequences from the C. acetobutylicum strains are al- 
most identical and differ only with respect to the last 11 aa. 
Whereas the C. kluyveri SucD protein showed high similarity to 
the AdhE (Aad, Adh2) N-terminal region, the 4HbD protein 
corresponded to the C-tcrminal region (Table 2). The E. coli 
adhE gene product has both CoA-dependent aldehyde dehy- 
drogenase activity and alcohol dehydrogenase activity (20). 
These functions are also proposed for the adhE and aad gene 
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FIG. 3. Nucleotide sequence of the 6,575-bp insert of pCKl representing the region of chromosomal DNA from C. kluyvcri that contains orfZ (3'-terminal 
fragment), orfY, cat 1, sue D, 4hbD, and sigL (5'-terminal fragment). The genes have been translated with the one-letter amino acid code with the symbols below the first 
nucleotide of the corresponding codon. Putative ribosome binding sites are underlined, and inverted repeats arc indicated by open arrows representing the length and 
orientation of the stems. Bars above the sequence indicate the -35 and -10 regions of a putative promoter motif. 
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TABLE 1. Enzyme activities of recombinant £. coll clones 
harboring the C kluyveri call, sucD, and 4hbD genes after induction 
from the pBluescript lac promoter 41 



Sp act (mU/mg) 



Succinate- 

Strain and plasmid SucdnyW3oA . semialdehyde 4_„ ydrwybutyrate 

CoA-transfcrasc dehydrogenase dchydrogcnasc 



-Co A 6 +CoA* 



E. coli JM109 


<1 


19 




1 <1 


<2 


pBluescript 












& coli JM109 pCKl 


184 ± 10 


51 




3 80 ± 17 


273 ± 66 


(cat] sucD 4hbD) 












E. coli JM109 


220 ± 13 


23 




2 <1 


22 ±5 


pCK169 (catl) 












E. coli JM109 pCK2 


<1 


49 


+ 


1 101 ± 15 


475 ± 87 


(sucD 4hbD) 













0 Cells were grown in LB medium containing ampicillin (75 mg/liler), and 
IPTG (1 raM) was added at an optical density of 0.5. Crude extracts were 
prepared and analyzed for enzyme activity as described in Materials and Meth- 
ods. Data represent average values and standard deviations from two indepen- 
dent experiments. 

* -Co A and +CoA refer to the CoA-dependent and -independent succinate- 
semialdehyde dehydrogenase activities. 



products from C acetobutylicum DSM792 and ATCC 824, 
respectively (16, 35). Significant similarities were also found 
between the C kluyveri 4HbD protein and other alcohol de- 
hydrogenases (Table 2). These enzymes as well as the E. coli 
and C. acetobutylicum adhE (aad) gene products represent a 
new class (III) of alcohol dehydrogenases, which differ from 
both the long-chain zinc-containing (type I) and the short- 
chain zinc-lacking (type II) enzymes (2, 26, 40). 

Analysis of orfY y orfZ, and sigL. As mentioned above, two 
possible sites for translation initiation of orJY were found. 
Hydropathy analysis of the deduced protein sequence revealed 
several highly hydrophobic regions. A translation start with 
AUG (as indicated by the arrow) would result in a protein of 
288 amino acids (30,873 Da) with nine hydrophobic and prob- 
ably membrane-spanning regions (Fig. 4). A translation start 
with UUG (311 aa, 33,507 Da) would produce a signal se- 
quence with a basic N terminus (two lysine residues) followed 



by 12 hydrophobic amino acids. A $(lacZ 'orfY)hyb30 fusion 
containing the first 30 aa of the lacZ a-peptide and 279 aa of 
orfY (starting with Pro-33 at bp 1073) was constructed from 
pCKl with the BamW restriction endonuclease site (pCK7). 
Recombinant E. coli clones were able to grow in LB medium, 
but if expression of the fusion protein was induced by the 
addition of IPTG (1 mM), cell lysis occurred, probably because 
of a disintegration of the cell membrane (not shown). 

Database searches revealed similarities (about 50%) to 
other putative membrane-spanning proteins from Desulfurolo- 
bus ambivalens (29), Haemophilus influenzae (52), and E, coli 
(DDBJ:D 13267). However, the function of these proteins is yet 
to be determined, and most of the similarities between them 
might be due to their common membrane-spanning function. 

Whereas the amino acid sequence deduced from the 5'- 
truncated ORF orfZ showed no significant homology to pro- 
tein sequences in the databases, the N-terminal amino acid 
sequence deduced from the last reading frame (sigL) f which 
was located at the 3' end of the pCKl insert, revealed signif- 
icant similarities to the family of rr^-related proteins (not 
shown). Since the N terminus of these proteins is highly con- 
served, it seems likely that sigL, which was designated in anal- 
ogy to the corresponding gene from another gram-positive 
organism (Bacillus subtilis), encodes a o^-homologous sigma 
factor. 

Northern blot analysis. To analyze the expression of the 
cloned genes in C. kluyveri, RNA was isolated from cells grown 
on ethanol plus succinate and ethanol plus acetate and pre- 
pared for Northern blot hybridization. With radiolabeled 
probes complementary to the 4hbD (bp 5668 to 6025) and the 
catl (bp 2117 to 2981) genes, significant hybridization predom- 
inantly occurred with RNA isolated from cells grown on eth- 
anol plus succinate (Fig. 5). A strong signal corresponding to 
approximately 2,700 nucleotides in length was obtained with 
the 4/*M)-complementary probe. In addition, two weak signals 
of about 5,500 and 9,500 nucleotides and a small transcript of 
about 1,800 nucleotides could be detected after prolonged 
exposure (Fig. 5). The caf/-complementary probe gave similar 
results: only low hybridization occurred with RNA from etha- 
nol-acetate-grown cells, whereas with RNA from cells grown 
on ethanol plus succinate, signals corresponding to a length of 



TABLE 2. Comparison of the C. kluyveri gene products SucD (succinate-semi aldehyde dehydrogenase) and 4HbD (4-hydroxybutyrate 

dehydrogenase) with aldehyde and alcohol dehydrogenases from other organisms 



Organism and gene product (reference(sl) 


Sequence positions* 


C. kluyveri SucD 6 
Identity (%) Similarity (%) 


C kluyveri 4Hbry 
Identity (%) Similarity (%) 


C. acetobutylicum AdhE-Aad (16, 35) 


2^58 


41.8 


58.8 








449-860 






26.1 


51.9 


E. coli AdhE (20) 


3^74 


37.2 


57.7 








451-862 






26.2 


51.4 


Entamoeba histolytica AdhE2 (57) 


7-491 


38.0 


60.8 








462-862 






26.1 


51.9 


C. acetobutylicum Adhl (61) 


1-381 






28.2 


55.2 


Saccharomyces ccrcvisiae Adh4 (54) 


4-380 






26.4 


51.4 


Zymomonas mobilis ATCC 10988 AdhB (59) 


1-383 






27.0 


50.1 


Zymomonas mobilis ZM4 AdhB (11) 


1-383 






26.6 


49.7 


Citrobacter freundii (12) 


25-393 






26.2 


50.0 


Bacillus methanolicus Mdh (14) 


3-381 






25.9 


49.3 


£ coli FucO (10) 


5-383 






23.6 


51.9 


C. acetobutylicum BdhA (51) 


6-389 






22.9 


47.9 


C. acetobutylicum BdhB (51) 


18-388 






21.2 


46.9 



• Numbers refer to the first and last amino acids of the resulting sequence comparison. 

* Sequence comparison was performed with the Bestfit algorithm (Genetics Computer Group package [13]) with a gap weight of 3.0 and a length weight of 0.1. 
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FIG. 4. Hydropathy plot of the orfY gene product according to the work of 
Kyte and Doolittle (30). The amino acid sequence was deduced from a transla- 
tion start with UUG (bp 981 [Fig. 3]), and the arrow indicates the first amino acid 
from a translation start with AUG (bp 1050 [Fig. 3)). Positive values represent 
high hydrophobicity and negative values indicate low hydrophobicity, averaged 
over a window of 7 aa (30). 



about 2,700, 5,500, and 9,500 nucleotides and some small tran- 
scripts were obtained (Fig. 5). 

DISCUSSION 

We screened recombinant E, coli clones for an oxidative 
utilization of 4-hydroxybutyrate and could isolate a DNA re- 
gion from C. kluyveri encoding the 4-hydroxybutyrate dehydro- 
genase, the CoA-dependent succinate-semialdehyde dehydro- 
genase, and a succinyl-CoA:CoA transferase. Since E. coli has 
two CoA-independent succinate semialdehyde dehydroge- 
nases which directly catalyze the oxidation of succinate semi- 
aldehyde to succinate (15), the use of 4-hydroxybutyrate as a 
carbon and energy source by E. coli depends only on the 



A B 



12 12 12 




FIG. 5. Northern blot hybridization with radiolabeled fragments which are 
complementary to the C. kluyveri 4hbD (A) and call (B) genes. Lanes 1, 8 u.g of 
total RNA from C. kluyveri cells, grown on ethanol plus succinate; lanes 2, 8 u,g 
of total RNA from C. kluyveri cells, grown on ethanol plus acetate. The cells were 
harvested in the logarithmic growth state. The sizes of selected marker bands 
(RNA ladder) are indicated in kilobascs at the left side of the figure. (A) Results 
obtained after hybridization with a 4/iM)-complementary probe and autoradiog- 
raphy for 9 and 21.5 h. (B) The blot was hybridized against a cat I -complementary 
probe and subjected to autoradiography for 16 h, as indicated. 
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presence and expression of the clostridial 4-hydroxybutyrate 
dehydrogenase-encoding gene (4hbD) as shown by the nested 
deletion subclone analysis. 

Thus far, the only known protein sequences of CoA-acylat- 
ing bacterial aldehyde dehydrogenases were those from the 
multifunctional AdhE enzymes from E. coli and the homolo- 
gous gene from C. acetobutylicum, which have both CoA-de- 
pendent aldehyde dehydrogenase and an alcohol dehydroge- 
nase activity (16, 20, 35). From the tandem arrangement of the 
C. kluyveri sucD and 4hbD genes and the similarities to the N- 
and C-terminal regions of the £. coli and C. acctobulylicum 
adhE (aad) gene products, it is evident that the AdhE proteins 
are composed of two catalytic domains: the N terminus exhibits 
a CoA-acylating aldehyde dehydrogenase activity whereas the 
C-terminal part is an alcohol dehydrogenase. This has also 
been suggested by Goodlove et al. (20), Fischer et al. (16), and 
Nair et al. (35). However, it should be noted that the E coli 
AdhE protein has a third function, the deactivation of the 
pyruvate:formate-lyase (28). Primary protein sequence com- 
parison of the C kluyveri SucD-4HbD proteins and the adhE 
(aad) gene products of C. acetobutylicum, E. coli, and the 
protozoan E. histolytica reveals some interesting features (Fig. 
6). The N-terminal region which corresponds to the C. kluyveri 
SucD protein (positions 1 to 472) allows now a first comparison 
of CoA-acylating aldehyde dehydrogenases (Fig. 6 A). The se- 
quence is highly conserved: from a consensus length of 490, 
138 aa are strictly conserved in the bacterial enzymes (99 aa 
including the protozoan AdhE protein sequence) with 321 aa 
being present in two of the three microbial species. The SucD 
homologous sequence includes one highly conserved motif for 
nucleotide binding (G-x-G-x-x-G; site 4) (53) and another re- 
gion with G-rich motifs (G-G-x-G; G-x-G-x-G; site 2), one of 
which also might contribute to ADP binding. Since CoA, like 
NAD(P)*, contains the ADP moiety, two nucleotide binding 
sites are reasonable. Another highly conserved region (site 3) 
contains a glycine and a cysteine residue and shows limited 
similarity to the active center proposed for CoA-independent 
aldehyde dehydrogenases (2, 23). It is suggested for these en- 
zymes as well as for the CoA-acylating aldehyde dehydroge- 
nases that an enzyme-bound thiol group (cysteine) is required 
at the catalytic site for the formation of a hemithioacetal in- 
termediate (25, 48). Database searches performed with the 
AdhE (Aad) proteins from C. acetobutylicum revealed signifi- 
cant but low homology to aldehyde dehydrogenases (CoA in- 
dependent) from different sources (<25% identity). In addi- 
tion to the active center described above, a conserved 
dodecapeptide of CoA-independent aldehyde dehydrogenases 
(G-V-TC-TGV-GQ-I-LIS-P-W-N-FY-P (24]) would corre- 
spond to positions 111 to 122 of the CoA-dependent enzymes 
(site 1 [Fig. 6A]). The proposed nucleotide binding site of 
CoA-independent aldehyde dehydrogenases would match po- 
sitions 174 to 179 and is thus not conserved. In addition, a 
typical decapeptide (V-TC-L-E-I^G-G-K-AS-P) of these en- 
zymes is missing. Since there is no similarity of CoA-dependent 
and CoA-independent aldehyde dehydrogenases in the C-ter- 
minal region, the highly conserved nucleotide binding site of 
CoA-dependent enzymes (site 4) might be involved in CoA 
binding. 

The C. kluyveri 4HbD protein as well as the AdhE proteins 
belongs to the class III alcohol dehydrogenases (2, 26, 40). 
Sequence comparison revealed a more or less common motif, 
which comprises several histidine residues (Fig. 6B, sites 1 and 
2) and might be involved in divalent cation (iron) binding (2, 
16, 61). The C kluyveri 4HbD protein contains three of the 
four histidine residues proposed to be involved in iron binding. 
However, a glutamate and an aspartate residue (site 1) are 
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CkSucD MSNEVS I KEL I E KAKAAQKKLE AYSQ EQVD VL VKALGKVVYDNAEM F AKEAVE ETBMGVYEDKVAKCHLKS GAI WNH I KDKKTVGI I KB E PE 92 

CaAdhE NKVTTVKELDEKLKVI KBAQKKF SCY SQEMVDE I FRNAAMAAIDAR I E LAKAAVLBTGMGLVEDKV I KNH PAGE Y I YNKYKDEKTCG 1 1 BRNB P 94 

CaAad MKVTTVKELDEKLKV I KBAQKKF SCY SQEMVDE I FRNAAMAAI DAR I ELAKAAVLBTGMGLVEDKVI KNH PAGE Y I YNKYKDE KTCGI I ERNE P 94 

BcAdhE MAVTNVAELNALVERVKKAQRBY AS FTQEQVDK I FRAAALAAADAR I PLAKMAVAESGMGI VEDKVI KNH FAS BY I YNAYKDEKTCGVLSBDDT 94 

BhAdhB MSTQQTMTVDEHINQLVRKAQVALKEYLKPEYTQEKIDYIVKKASVAAIJ^HCAIjAAAAVBBTGRGIFE 100 

••»*•••*., 100 

ffGV— P—Pfl [fG 
CkSucD RAL VYV AX PKGWAATT P I TN P WT PM CN AMAA I KG RNT 1 1 V A P H P KAKKVS AHTVE LMN AE L KKLG AP E N 1 1 Q I V EAP S R EAAKE LME S AD V - -VIATO 190 
CaAdhE YGITKIAEPIGWAAIIPVTNPTSTTIFKSLISLKTRNGIFPSPHPRAKKSTIIJU^TILDA^ -ATQ 192 

CaAad YG I TKI AE P I GW AA 1 1 P VTN PTSTT I FKSL I SLKTRNG I F FS PH PRAKKST I LAAKT I LDAAVKS GAPENI I GW I DE P S I E LTQ YLMQKAD I TL - - ATO 192 
EcAdhE FGTITIAEPIGIICGIVPTTNPTSTAIFKSLISLKTRNAIIFSPHPRAKDATNKAADIVLQAAIAAGA^ 194 
EhAdhE YGITEIAEPVGVVCGVTPVTNPTSTAIFKSLISIKTRNPIVFSFH^ 200 

|i 1 = i | . . . lt=> 200 

G^S^= 1 — G — ■ — G^O=^^) — O " r \ \ 

CkSucD GAGRVKAAYSSaRPAYGVGPGNSQVIVDKGYDYNKAAQDIITGRKYDNGIICSSEQSVIAPAEDYDKVIAAFVENGAFYVEDEETVEKFRSTL 283 

CaAdhE GPSLVKSAYSSGKPAIGVGPGNTPVIIDESAHIKMAVSSIILSKTYDNGVICASEQSVIVLKSIYNKVKDEFQERGAYriKKNBLDKVREVIF 285 

CaAad GPSLVKSAYSSGKPAIGVGPGNTPVIIDBSAHIKMAVSSIILSKTYDNGVICASEQSVIVLKSIYNKVKDEFQERGAYriKKNELDKVREVIF 285 

EcAdhE GPGMVKAAYSSGKPAIGVGAGOTPVVIDETADIKRAVASVLMSKTFDNGVICASEQSV 287 

EhAdhE GNAMVKAAYSSGKPALGVGAGNVPTYIEKTCNIKQAANDVVMSKSFDNGMICASEQAAIIDKEIY 300 

2 H . . I ' 3 ' I . . . .300 

CkSucD - FKDGKINSKIIGKSVQIIADLAGVKVPEGTKVIVLKGKGAGBKDVLCKEKMCPVLVALKYDTFEBAVEIA^ 382 

CaAdhE - -KDGSVNPKIVGQSAYTIAAMAGIKVPKTTRILIGEVTSIjGBBEPFAHEKLSPVLAMYBADNFDDALKKAVT^ 376 

CaAad - - KDGSVNPKIVGQSAYTIAAMAGIKVPKTTRILIGEVTSLGEEEPFAHEK^ 376 

EcAdhE - - KNGALNAAI VGQPAYKIAEIjAGFSVPENTKILIGEVTVVDESEPFAHEKLSPTLAMYRAKDFED - 378 

EhAdhE DVNNARLNPKC PGMS PQW F AEQVG I KVPE DCN 1 1 CAVC KE VG PNE PLTRE KLS PVLAI L KABNTQDG I DKAEAMVE FNGRGH S AA I HSND 390 

• • • • * • • . . .400 

jjG=G=Oj 

CkSucD LPISRLWNQPATTAGGTVLPISRLVVNQPATTAGGSFNNGFNPTTTLGCGSWGRNSISENLTYEHLINVSRIGYFNKEAKVPSYBBIWG 472 

CaAdhE RDKIDRFSSAMKTVRTFVNI PTSQGASGDLYNFRI P P S FTLGCGFWGGN S VS ENVGPKHLLNI KTVAERRENML W FRV PH KVY 459 

CaAad RDKIDRFSSAMKTVRTFVNI PTSQGASGDLYNFRI PPS FTLG CG F WGGNS VS ENVG P KHLLNI KTVAERRENML W FRV PH KVY 459 

EcAdhE - - P ARVS Y FGQKM KT AR I L I NT PAS QGGI GDLYN F KLAP S LTLG COS WGGNS I S ENVG P KHL I NKKTVAKRAENMLWHKL P KS I Y 4 61 

EhAdhE - -KAWEKYALTMKACRILHNTPSSQGGIGSIYNY- IWPS FTLGCGS YGGNSVSANVTYHNLLNI KRLADRRNNLQWFRVPPKI F 4 72 

1^=4=9 . . . .490 

B 

Ck4HbD MKLLKLAPDVYKFDTAEEFMKYFKVGKGDPILTNEFLYKPFLEKFNDGADAVFQEKYGLGEPSDEMINNIIKDIGDKQYNRIIAVGGGSVIDIAKILSLK 100 
CaAdhE FKFGCLQFALKDLKDLKKKRAFIVTDSDPYNLNYVDS I I KILEHL- - DIDFKVFNKVG - READLKTI KKATEEMSSFMPDTI I ALGGTPEMSSAKLMWVL 556 
CaAad F K FGCLQFALKDL KDLKKKRAF I VTD S D PYNLNYVDS 1 1 K I LEHL - -DIDFKVFNKVG -READLKTI KKATEEMSSFMPDTI IALGGTPEMSSAKLMWVL 556 
EcAdhE FRRGSLPIALDEVITDGHKRALIVTDRFLFNNGYADQITSVLKAA--GVETEVFFEVE-ADPTL^^ 558 

EhAdhE FEPHSIRYLAELKB LSKIFIVSDRMMYKLGYVDRVMDVLKRRSNBVEIEIFIDVE-PDPSIQTVQKGLAVMNTFGPDNIIAIGGGSAMDAAKIMWLL 568 

•■••■•«... 100 

frD=H=» 

Ck4HbD YTDDSLDLFBGKVPLV - -ICNKELIIVPTTCGTGSEVTNVSVAELKRRHTKKGIASDELYATYAVLVPEFIKGLPYKFFVTSSVDALIHA 187 

CaAdhE YEHPEVKFEDIAIKFMDIRIGUYTFPKLGKKAMLVAITTSAGSGSEVTPFALVTDNNT^^ 656 
CaAad YEHPEVKFEDIAIKFMDIRKRIYTFPKliGKKAMLVAITTSAGSGSEVTPFALVTDNNTC 656 
EcAdhE YEHPETHFEELALRFMDIRKRIYKFPKMGVKAICMIAVTTTSGTGSEVTPFAVVTDDATGQKYPLADYALTPDNIAIVDA^ 658 
EhAdhE YEHPEADFFAMKQKFIDLRKRAFKFPTMGKKARLICIPTTSGTGSEVTPFAVISDHETCKKYPIJ^ 668 

P ■ 1= 20 0 

=EtJ |pG=H==AH C PHGii 

Ck4HbD TBAYVSPNANPYTDMFSVKAMELILNGYMQMVEKGNDYRVEIIBDFVIGSNYAGIAFGNAGVGAVH^ 277 
CaAdhE IEAYTSVYASEYTNGLiALEAIRLIFK-YLPEAYKNGRTNEKAREKMAHASTMAGMASA 755 
CaAad IBAYTSVYASEYTNGLALEAIRLIFK-YLPEAYKNGRTNEKAREKMAHASTMAGMASANAFl^ 755 
EcAdhE MEAYVSVIASEFSDGQALQALKLLKE-YLPASYHEGSKNPVARERVHSAATIAGIAFANAFLGVCH^ 757 
EhAdhE TBAYVSVMANEYTDGLAREAVKLVFE -NLLKSYNG- - - DLEAREKMHNAATIAGMAFASAFIX3MDHSMAHKVGAAFHLPHGRCVAVLLPHVIRYNG-QKP 763 

=3 ...... 1 * 1 2 » ') . . 300 

Ck4HbD" NGKIKDVNKIiLAGILKCDESEAYDSLSQLLDKLLSRKPLRBYGMKEEEIETFADSVIEGQQRLLVNNYEP 347 

CaAdhE VKQAPCPQYKYPNTIFRYARIADYIKLG- - GNTDE E KVDLL I NKI H ELKKALN I PTS I KD AGVLEENFY S SLDR I S ELALDDQCTGAN PR F PLTSB I KEM 846 

CaAad VKQAPCPQYKYPNTIFRYARIADYIKLG- - GNTDE E KVDLL I NK I H EL KKALN I P.TS I KD AGVLE ENFY S SLDR I SELALDDQCTG AN PR F PLTSE I KEM 853 

EcAdhE TKQTAFSQYDRPQARRRYAEIADHWSLSAPGDRTAAKIEKLIjAWLETLKAELGIPKSIREAGVQEADPIJuWDKLSE 857 

EhAdhE RKLAMWPKYNFYKADQRYMELAQMVGLKC- -NTPAEGVEAFAKACEELMKATBTITGFKKANIDEAAWMSKVPEMALLAFEDQCSPANPRVPMVKDMEKI 858 

.......... 400 

Ck4HbD FSREDIVNTYKKLY 371 

CaAdhE YIN CP - KKQP 862 

CaAad YINFVLKNNLKPSYF -NYFL 872 

EcAdhE LLDTYYGRDYVEGETAAKKEAAPAKAEKKAKKSA 891 

EhAdhE LKAAYYP - I A 870 

924 

FIG. 6. Amino acid alignment of the sucD and 4hbD gene products from C. kluyveri (CkSucD [A] and Ck4HbD (B), respectively) with the adhE gene products from 
C. acctobutylicum (CaAdhE [16] and CaAad (35]), E. coli (EcAdhE [20]), and E. histolytica (EhAdhE [57]). The C. kluyveri SucD and 4HbD proteins are depicted in 
the first line and cover the positions 1 to 472 and 1 to 371, as indicated. (A) The first site indicates a conserved dodecapeptide for aldehyde dehydrogenases (24). 
Glycine-ricb motifs (sites 2 and 4) indicate putative nucleotide binding sites of the aldehyde dehydrogenase domain. Site 3 depicts the proposed active center of the 
aldehyde dehydrogenase domain with a conserved cysteine residue. (B) Sites 1 and 2 represent the proposed iron binding motif for the class III alcohol dehydrogenases 
(2, 61) comprising several histidinc residues. 
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CkCatl MSKGIKNSQLKKJQ^KASNVAEKIESKVEKTDKVVEKAAEVTEKRIRNLKI^EKVW 99 

NcAcuS MAS PI ASAALRARQGPSMLKKLCNPSDMLQHFPNGAYIGWSGFTASATPRR- SYYLADHVEKNASRP 66 

ScAchl MT--I SNLLKQRVRYAPYLKKVKEAHELI PLFKNGQYLGWSGFTGVGTPKAVPEALIDHVEKNNLQG 6 5 

»>*•«•««•« 100 

CkCatl FKVTLYTGSSTGADIDGEWAKAGIIERRIPYQTNSDMRKKIira VDI AZ I EAVAITEEGD 1 1 PSTGI 189 

NcAcu8 AQVQPLRRCLRRRRDRERWAALDMIAHDRRRAPHQVGKNIAKGIN^^ 165 

ScAchl KLRFNLFVGAS AGPEENRWAEHDMI I KRAPHQVGKPIAKAINQGRIEFFDKHLSMFPQDLTYGFYTRERKDNKILDYTIIEATAIKEDGSIVPGPSV 162 

• • • • « • • . . .200 



CkCatl GirTATPVBNADKVIVEINEAQPLELEGMADIYTLKNPPRREPIPIVNAGN^ PVSQAISDNLIGFLNKE 289 

NcAcuS GATPELIQMADKVI IEVNTAI P- SFDGIJIDITFSDLPPNRKPYIjIG^CRDRIGTTSVPVDPEKVVGI IECTTPDQTLPNSPADETATAIAGHLIEFFEHE 264 
ScAchl GGSPEFITVSDKVIIEVNTATP-SFEGIHDIDMPVNPPFIIKPYPYLKVDD^ 261 

* • • « • • • .300 



CkCatl VEEGKLPKNLLPIQSGVGSVANAVLAGLCESNFKNLSCXTEVIQ^ 389 

NCACU8 VAHGRLPKNLLPI^SGIGNIANAVIGGLETSNPKNLNVWTEVIQDTFLDLFDSGKLDFATATSIRFSPTGFERFYK2JTO 364 

ScAchl VKHGRLPENLLPLQSaiONIANAVIEGLAGAQFKHLTVWTEVI^DSLLDLFENGSLDYSTATSVRLTEKGFDPJa^ANWENFra 361 

P • . • • . . 400 



CkCatl RIGVISINTALEVDIYGNVNSTHVMGSKMMNGIGGSGDFARNAYLTIFTTESI AK KGD I S S I V PMVS H VDHTEHDVMV I VTEQGVADLRGLS PREK 485 

NcAcuS RI/3V I GMNT P VE VDI YAHANS TKVMGS RMLNGLGG S ADFLRNSKYS I MHT PSTR PS KTDAHG VS C I V PMCTHVDQTEHDLDVI VT ENGLADVRGLS PRER 464 
ScAchl RFPVIAMNTPVEVDI YAHANS TNVNGS RMLNGLGG SADFLRNAKLS I MHAPSARPTKVDPTG I STIVPMASHVDQTEHDLDILVTDQGLADLRGLSPKER 461 

. p ... .... 500 

CkCatl AVAIIENCVHPDYKDMLMEYFEBAC KSSGGNTPHNLEKALSWHTKFIKTGSMK 53 8 

NcAcuS ARV 1 1 DKCAHD VY KP I L KAY F E KAE FECLR KGMGH E PHLL FNS FDMHKALVEEG SMAKVK F 52 5 

ScAchl AREIINKCAHPDYQALLTDYLDRAEHYAKKHNCLHEPHMLR1JAFKFHTNLAEKGTMKVDSWEPVD 526 

• • • • • >565 

FIG. 7, Amino acid alignment of the C. kluyveri succiny!-CoA:CoA transferase (CkCatl) with a protein necessary for acetate utilization in N. crassa (NcAcuS) and 
with an acctyl-CoA hydrolase from S. cercvisiae (ScAchl). Blocks indicate putative sites for nucleotide (CoA) binding. 



strictly conserved among all class III alcohol dehydrogenases 
and might also be involved in metal binding. It is evident that 
the clostridial 4HbD enzyme requires divalent cations for ac- 
tivity; our assay includes Mg 2+ (1 mM) as described elsewhere 
(22, 49). However, the ion specificity of the enzyme has not yet 
been investigated. 

The E. coli AdhE is a homodimeric protein composed of two 
96-kDa subunits (9). Interestingly, an AdhE-like aldehyde- 
alcohol dehydrogenase enzyme complex has also been purified 
and characterized from C. kluyveri, which is composed of two 
55- and two 42-kDa subunits (33). From the data presented 
here, a gene arrangement similar to sucD-4hbD might be ex- 
pected. Cloning and sequencing of the corresponding genes 
could be an important step for the elucidation of evolutionary 
relationships within this class of enzymes. A dimeric aldehyde 
dehydrogenase domain also corresponds to the composition of 
several CoA-acylating enzymes: the succinate-semialdehyde 
dehydrogenase from C. kluyveri (2 X 55 kDa [49]), the butyr- 
aldehyde dehydrogenase from C. acetobutylicum (2 X 55 kDa 
[36]), and the aldehyde dehydrogenase from Clostridium bei- 
jerinckii (2 X 56 kDa [56]). 

During growth on ethanol and succinate, a succinyl-CoA: 
CoA transferase reaction is required for the initial activation of 
the substrate (49). Heterologous expression of the clostridial 
catl gene in E. coli from the pBluescript lac promoter identi- 
fied its gene product as a succinyl-CoA:CoA transferase. 
Northern blot analysis revealed that catl is efficiently tran- 
scribed only during growth on ethanol plus succinate. We 
could not detect sequence similarities between Catl and the 
heterodimeric CoA transferases from Pseudomonas putida, 
Acinetobacter calcoaceticus, and C acetobutylicum or the mo- 
nomeric enzyme from pig heart, which is presumably cleaved 
after translation (18, 32, 37). It is somewhat surprising that the 
C. kluyveri succinyl-CoA:CoA transferase (Catl) and the 
Acu-8 protein sequence from N. crassa indicate high homolo- 
gies to an acetyl-CoA hydrolase from S. cerevisiae (Fig. 7). 



However, the conserved CoA (ADP) binding site from the 
heterodimeric CoA transferases (-G-x-G-x-x-G- [37]) is also 
abundant (-G-x-G-x-x-G/A-) in the amino acid sequences of 
Achl (S. cerevisiae), Acu-8 (A/, crassa), and Catl (C. kluyveri 
[Fig, 7]). As in C kluyveri, growth of N. crassa on acetate 
should not involve an acetyl-CoA hydrolase, but a succinyl- 
CoA:CoA transferase, which catalyzes the activation of ace- 
tate, might be required. 

Northern blot analysis revealed that the cloned gene region 
is efficiently transcribed during growth on ethanol plus succi- 
nate (Fig. 5), The signal of 2,700 nucleotides obtained with the 
4fcM)-complementary probe most likely represents a common 
transcription oisucD and 4hbD, which together are 2,590 bp in 
length. The presence of a rho-independent terminator and an 
additional stem-loop downstream of the 4hbD gene further 
supports this hypothesis. The signals of 5,500 and 9,500 nucle- 
otides might represent a common transcription of orfY, catl, 
sucD, and 4hbD (which together cover 5,042 bp) and point to 
an additional transcription start point upstream of the cloned 
gene region. Because of the high A+T content of the clostrid- 
ial DNA, especially in the intergenic regions, identification of 
putative promoter structures by sequence comparison alone is 
often ambiguous. However, a more or less significant motif 
(TGGTAG-21 bp-TATAAT) is located 44 bp upstream of 
sucD. Clearly, more experiments (including primer extension) 
are needed for a detailed transcription analysis. 

The data on heterologous expression of catl, sucD, and 
4hbD are in good agreement with the results from the work of 
Wolff et al. (55), who recently reported on several protein 
bands in cell extracts from C. kluyveri, which occurred only 
during growth on ethanol plus succinate (100, 66, 58, 55, 37, 
and 30 kDa). The 30-kDa protein band would correspond to 
orfY, but it could not be identified during expression of the 
cloned genes in E. coli, probably because of poor translation 
initiation. 
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